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ADENO-ASSOCIATED VIRUS (AAV) CLADES. SEQUENCES. 
VECTORS CONTAINING SAME. AND USES THEREFOR 



STATEMENT REGARDING FEDERALLY SPONSORED RESEARCH OR 
DEVELOPMENT 

This application contains work supported by grants NIDDK P30 DK47757 and 
NHLBI POl HL59407 from the National Institutes of Health (NIH). The US government 
may have certain rights in this invention. 

BACKGROUND OF THE INVENTION 

Adeno-associated virus (AAV), a member of the Parvovirus family, is a small 
nonenveloped, icosahedral virus with single-stranded linear DNA genomes of 4.7 
kilobases (kb) to 6 kb. AAV is assigned to the genus, Dependovirus, because the virus 
was discovered as a contaminant in purified adenovirus stocks. AAV's life cycle include 
a latent phase at which AAV genomes, after infection, are site specifically integrated into 
host chromosomes and an infectious phase in which, following either adenovirus or 
herpes simplex virus infection, the integrated genomes are subsequently rescued, 
replicated, and packaged into infectious viruses. The properties of non-pathogenicity, 
broad host range of infectivity, including non-dividing cells, and potential site-specific 
chromosomal integration make AAV an attractive tool for gene transfer. 

Recent studies suggest that AAV vectors may be the preferred vehicle for gene 
delivery. To date, there have been several different well-characterized AAVs isolated 
from human or non-human primates (NHP). 

It has been found that AAVs of different serotypes exhibit different transfection 
efficiencies, and also exhibit tropism for different cells or tissues. However, the 
relationship between these different serotypes has not previously been explored. 

What is desirable are AAV-based constructs for delivery of heterologous 
molecules. 
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SUMMARY OF THE INVENTION 

The present invention provides "superfamilies" or "clades" of AAV of 
phylogenetically related sequences. These AAV clades provide a source of AAV 
sequences useful for targeting and/or delivering molecules to desired target cells or 
5 tissues. 

In one aspect, the invention provides an AAV clade having at least three AAV 
members which are phylogenetically related as determined using a Neighbor-Joining 
heuristic by a bootstrap value of at least 75 % (based on at least 1 000 replicates) and a 
Poisson correction distance measurement of no more than 0.05, based on alignment of the 

10 AAV vpl amino acid sequence. Suitably, the AAV clade is composed of AAV sequences 
useful in generating vectors. 

The present invention further provides a human AAV serotype previously 
unknown, designated herein as clone 28.4/hu.l4, or alternatively, AAV serotype 9. Thus, 
in another aspect, the invention provides an AAV of serotype 9 composed of AAV capsid 

15 which is serologically related to a capsid of the sequence of amino acids I to 736 of SEQ 
ID NO: 123 and serologically distinct from a capsid protein of any of AAVl, AAV2, 
AAV3, AAV4, AAV5, AAV6, AAV7 or AAV8. 

Vectors constructed with capsid of this huAAV9 have exhibited gene transfer 
efficacies similar to AAV8 in liver, superior to AAVl in muscle and 200 fold higher than 

20 AAV 5 in lung. Further, this novel human AAV serotype shares less than 85% sequence 
identity to previously described AAVl through AAV8 and is not cross-neutralized by any 
of these AAVs. 

The present invention also provides other novel AAV sequences, compositions 
containing these sequences, and uses therefor. Advantageously, these compositions are 
25 particularly well suited for use in compositions requiring re-administration of AAV 
vectors for therapeutic or prophylactic purposes. 

These and other aspects of the invention will be readily apparent from the 
following detailed description of the invention. 

30 BRIEF DESCRIPTION OF THE DRAWINGS 

Fig. 1 is a tree showing the phylogenic relationship constructed using the 
Neighbor-Joining heuristic with Poisson correction distance measurement. The 
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relationship was determined based on the isolated AAV vpl capsid protein, with the 
isolated AAV grouped in clades. Groups of individual capsid clones are classified in 
clades based on their common ancestry. Clade nomenclature goes from A through F; 
subtypes are represented by the clade letter followed by a number. 
5 Figs. 2A-2AE are an alignment of the amino acid sequences of AAV vpl capsid 

proteins of the invention, with the numbering of the individual sequences reported, and 
previously published AAVl [SEQ ID NO: 219]; AAV2 [SEQ ID NO: 221]; AAV3-3 
[SEQ ID NO: 217]; AAV4-4 [SEQ ID NO: 218]; AAV5 [SEQ ID NO: 216]; AAV6 
[SEQ ID NO: 220]; AAV7 [SEQ ID NO: 222]; AAV8 [SEQ ID NO: 223], and; rh. 25/42- 
10 15; 29.3/bb. I ; cy.2; 29.5/bb.2; rh.32, rh.33, rh.34, rh. 1 0; rh.24; rh 1 4, rh. 1 6, rh. 1 7, rh. 1 2, 
rh.l8, rh.21 (formerly termed 41.10); rh.25 (formerly termed 41.1 5); rh2; rh.3l; cy.3; 
cy.5; rh.l3; cy.4; cy.6; rh.22; rh.l9; rh.35; rh.37; rh.36; rh.23; rh.8; and ch.5 [US 
Published Patent Application No. 2003/0138772 Al (Jul 24, 2003)]. The sequences of 
the invention include hu.l4/AAV9 [SEQ ID NO: 123]; hu.l7 [SEQ ID NO: 83 ], hu. 6 
15 [SEQ ID NO: 84 ], hu.42 [SEQ ID NO: 85], rh.38 [SEQ ID NO: 86], hu.40 [SEQ ID 
NO: 87], hu.37 [SEQ ID NO: 88 ], rh.40 [SEQ ID NO: 92], rh.52 [SEQ ID NO: 96]; 
rh.53 [SEQ ID NO: 97]; rh.49 [SEQ ID NO: 103];rh.51 [SEQ ID NO: 104];rh.57 [SEQ 
ID NO: 105 ]; rh.58 [SEQ ID NO: 106 ], rh.61 [SEQ ID NO: 107]; rh.50 [SEQ ID NO: 
108 ]; rh.43 [SEQ ID NO: 163]; rh.62 [SEQ ID NO: 1 14 ]; rh.48 [SEQ ID NO: 1 1 5]; 4- 
20 9/rh.54(SEQIDNo: 116);and4-19/rh.55(SEQIDNos: 117);hu.31 [SEQ ID NO: 121]; 
hu.32 [SEQ ID NO: 122]; hu.34 [SEQ ID NO: 1 25]; hu.45 [SEQ ID NO: 1 27]; hu.47 
[SEQ ID NO: 128]; hu.13 [SEQ ID NO:I29]; hu.28 [SEQ ID NO: 1 30]; hu.29 [SEQ ID 
NO: 132]; hu.l 9 [SEQ ID NO: 1 33]; hu.20 [SEQ ID NO: 1 34]; hu.2 1 [SEQ ID NO: 1 35]; 
hu.23.2 [SEQ ID NO:137]; hu.22 [SEQ ID NO: 138]; hu.27 [SEQ ID NO: 140]; hu.4 
25 [SEQ ID NO: 141]; hu.2 [SEQ ID NO: 143]; hu.l [SEQ ID NO: 144]; hu.3 (SEQ ID 

NO: 145];hu.25 [SEQ ID NO: 146]; hu.l 5 [SEQ ID NO: 147]; hu.l 6 [SEQ ID NO: 148]; 
hu.l 8 [SEQ ID NO: 149]; hu.7 [SEQ ID NO: 150]; hu.l 1 [SEQ ID NO: 1 53]; hu.9 [SEQ 
ID NO: 155];hu.l0[SEQIDNO: 156];hu.48 [SEQ ID NO: 157]; hu.44 [SEQ ID NO: 
1 58]; hu.46 [SEQ ID NO: 1 59]; hu.43 [SEQ ID NO: 1 60]; hu.35 [SEQ ID NO: 1 64]: 
30 hu.24[SEQIDNO: 136]; rh.64 [SEQ IDNO: 99]; hu.41 [SEQ ID NO: 91]; hu.39 [SEQ 
ID NO: 102];hu.67 [SEQ ID NO: I98];hu.66 [SEQ ID NO: 197];hu.51 [SEQ ID NO: 
I90];hu.52 [SEQ IDNO: 191];hu.49 [SEQ IDNO: 189]; hu.56 [SEQ ID NO: 192]; 
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hu.57 [SEQ ID NO: 193]; hu.58 [SEQ ID NO: 194]; hu.63 [SEQ ID NO: 195]; hu.64 
[SEQ ID NO: 196]; hu.60 [SEQ ID NO: 184]; hu.61 [SEQ ID NO: 185]; hu.53 [SEQ ID 
NO: 186]; hu.55 [SEQ ID NO: 1 87]; hu.54 [SEQ ID NO: 1 88]; hu.6 [SEQ ID NO: 84]; 
and rh.56 [SEQ ID NO: 1 52]. These capsid sequences are also reproduced in the 
Sequence Listing, which is incorporated by reference herein. 

Figs. 3A- 3CN are an alignment of the nucleic acid sequences of AAV vpl capsid 
proteins of the invention, with the numbering of the individual sequences reported, and 
previously published AAV5 (SEQ ID NO: 199); AAV3-3 (SEQ ID NO: 200): AAV4-4 
(SEQ ID NO: 201); AAVl (SEQ ID NO: 202); AAV6 (SEQ ID NO: 203): AAV2(SEQ 
ID NO: 21 1); AAV7 (SEQ ID NO: 213) and AAV8 (SEQ ID NO: 214); rh. 25/42-15; 
29.3/bb.l; cy.2; 29.5/bb.2; rh.32, rh.33, rh.34, rh.lO; rh.24; rhl4, rh.l6, rh.l7, rh.l2, 
rh.1 8, rh.21 (formerly termed 41.10); rh.25 (formerly termed 41.15; GenBank accession 
AY530557); rh2; rh.3I; cy.3; cy.5; rh.13; cy.4: cy.6: rh.22: rh.l9: rh,35: rh.37: rh 36: 
rh.23; rh.8; and ch.5 [US Published Patent Application No. 2003/0138772 A I (Jul 24, 
15 2003)]. The nucleic acid sequences of the invention include, hu.l4/AAV9 (SEQ ID No: 
3); LG-4/rh.38 (SEQ ID No: 7); LG-10/rh.40 (SEQ ID No: 14); N721-8/rh.43 (SEQ ID 
No: 43); I -8/rh.49 (SEQ ID NO: 25); 2-4/rh.50 (SEQ ID No: 23); 2-5/rh.5 1 (SEQ ID 
No: 22); 3-9/rh.52 (SEQ ID No: 1 8); 3-1 l/rh.53 (SEQ ID NO: 1 7); 5-3/rh.57 (SEQ ID 
No: 26)'; 5-22/rh.58 (SEQ ID No: 27); 2-3/rh.61 (SEQ ID NO: 21); 4-8/rh.64 (SEQ ID 
20 No: 15);3.l/hu.6(SEQIDNO: 5); 33.12/hu.l7 (SEQ lDNO:4); 106.1/hu.37 (SEQ ID 
No: 10); LG-9/hu.39 (SEQ ID No: 24); 114.3/hu.40 (SEQ ID No: 1 1); 127.2/hu.4l (SEQ 
ID N0:6); 127.5/hu.42 (SEQ ID No: 8); and hu.66 (SEQ ID NO: 173 ); 2-15/ rh.62 (SEQ 
ID NO: 33); l-7/rh.48 (SEQ ID NO: 32); 4.9/rh.54 (SEQ ID No: 40); 4-19/rh.55 (SEQ ID 
NO: 37); 52/hu.l9 (SEQ IDNO: 62), 52.1/hu.20 (SEQ IDNO: 63), 54.5/hu.23 (SEQ ID 
25 No: 60), 54.2/hu.22 (SEQ ID No: 67), 54.7/hu.24 (SEQ ID No: 66), 54. 1 /hu.2 1 (SEQ 1 D 
No: 65), 54.4Ryhu.27 (SEQ ID No: 64); 46.2/hu.28 (SEQ ID No: 68); 46.6/hu.29 (SEQ 
ID No: 69); 128.1/hu.43 (SEQ ID No: 80); 128.3/hu.44 (SEQ ID No: 81) and 
130.4/hu.48 (SEQ IDNO: 78); 3.1/hu.9 (SEQ ID No: 58); 16.8/hu.lO (SEQ ID No: 56); 
16,12/hu.ll (SEQIDNo:57); 145.1/hu.53 (SEQ ID No: 176); 145.6/hu.55 (SEQ ID No: 
30 178); 145.5/hu.54(SEQlDNo: 1 77); 7.3/hu.7 (SEQ ID No: 55); 52/hu. 19 (SEQ ID No: 
62); '33.4/hu.l5 (SEQ ID No: 50); 33.8/hu. 1 6 (SEQ ID No: 51): 58.2/hu.25 (SEQ ID No: 
49)'; 161.l0/hu.60 (SEQ ID No: 170); H-5/hu.3 (SEQ ID No: 44); H-l/hu.l (SEQ ID No: 
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46); 161.6/hu.61 (SEQIDNo: 174);hu.3I (SEQIDNo: 1); hu.32 (SEQ ID No: 2); hu.46 
(SEQ ID NO: 82); hu.34 (SEQ ID NO: 72); hu.47 (SEQ ID NO: 77); hu.63 (SEQ ID NO: 
204); hu.56 (SEQ ID NO: 205); hu.45 (SEQ ID NO: 76); hu.57 (SEQ ID NO: 206); 
hu.35 (SEQ ID NO: 73); hu.58 (SEQ ID NO: 207); hu.5 1 (SEQ ID NO: 208): Iul49 (SFQ 
5 ID NO: 209); hu.52 (SEQ ID NO: 210); hu.13 (SEQ ID NO: 71); hu.64 (SEQ ID NO: 
212); rh.56 (SEQ ID NO: 54); hLL2 (SEQ ID NO: 48); hu.1 8 (SEQ ID NO: 52): hu.4 
(SEQ ID NO: 47); and hu.67 (SEQ ID NO: 215). These sequences are also reproduced in 
the Sequence Listing, which is incorporated by reference herein. 

Figs. 4A - 4D provide an evaluation of gene transfer efficiency of novel primate 

10 AAV-based vectors m vitro and in vivo. AAV vectors were pseudotyped as described 
[Gao et al, Proc Natl Acad Sci VSA,99\\\ 854- 1 1 859 (Sept. 3, 2002)] with capsids of 
AAVs 1, 2, 5, 7, 8 and 6 and ch.5, rh.34, cy.5, rh.20, rh.8 and AAV9. For in viiro study. 
Fig. 4A, 84-32 cells (293 cells expressing E4 of adenovirus serotypes) seeded in a 95 well 
plate were infected with pseudotyped AAVCMVEGFP vectors at an MO I of I x 1 0"* GC 

15 per cell. Relative EGFP transduction efficiency was estimated as percentage of green 
cells using a UV microscope at 48 hours post-infection and shown on the Y axis. For in 
vivo study, the vectors expressing the secreted reporter gene A 1 AT were administered to 
the liver (Fig. 4B), lung (Fig. 4C) and muscle (Fig. 4D) of NCR nude mice (4-6 weeks 
old) at a dose of 1 x 10** GC per animal by intraportal (Fig. 4B), intratracheal (Fig. 4C) 

20 and intramuscular injections (Fig. 4D), respectively. Serum Al AT levels (ng/mL) were 
compared at day 28 post gene transfer and presented on the Y axis. The X axis indicates 
the AAVs analyzed and the clades to which they belong. 



DETAILED DESCRIPTION OF THE INVENTION 

25 In any arsenal of vectors useful in therapy or prophylaxis, a variety of distinct 

vectors capable of carrying a macromolecule to a target cell is desirable, in order to 
permit selection of a vector source for a desired application. To date, one of the concerns 
regarding the use of AAV as vectors was the lack of a variety of different virus sources. 
One way in which the present invention overcomes this problem is by providing 

30 clades of AAV, which are useful for selecting phylogenetically related, or where desired 
for a selected regimen, phylogenetically distinct, AAV and for predicting function. The 
invention further provides novel AAV viruses. 
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The term "substantial homology" or "substantial similarity," when referring to a 
nucleic acid, or fragment thereof, indicates that, when optimally aligned with appropriate 
nucleotide insertions or deletions with another nucleic acid (or its complementary suanJ), 
there is nucleotide sequence identity in at least about 95 to 99% of the aligned sequences. 
5 Preferably, the homology is over full-length sequence, or an open reading frame thereof, 
or another suitable fragment which is at least 15 nucleotides in length. Examples of 
suitable fragments are described herein. 

The terms "sequence identity" "percent sequence identity" or "percent identicaT' 
in the context of nucleic acid sequences refers to the residues in the two sequences which 

10 are the same when aligned for maximum correspondence. The length of sequence 

identity comparison may be over the full-length of the genome, the full-length of a gene 
coding sequence, or a fragment of at least about 500 to 5000 nucleotides, is desired. 
However, identity among smaller fragments, e.g. of at least about nine nucleotides, 
usually at least about 20 to 24 nucleotides, at least about 28 to 32 nucleotides, at least 

15 about 36 or more nucleotides, may also be desired. Similarly, "percent sequence identity" 
may be readily determined for amino acid sequences, over the full-length of a protein, or 
a fragment thereof Suitably, a fragment is at least about 8 amino acids in length, and 
may be up to about 700 amino acids. Examples of suitable fragments are described 
herein. 

20 The term "substantial homology'' or "substantial similarity," when referring lo 

amino acids or fragments thereof, indicates that, when optimally aligned with appropriate 
amino acid insertions or deletions with another amino acid (or its complementary strand), 
there is amino acid sequence identity in at least about 95 to 99% of the aligned sequences. 
Preferably, the homology is over full-length sequence, or a protein thereof, e.g., a cap 

25 protein, a rep protein, or a fragment thereof which is at least 8 amino acids, or more 

desirably, at least 15 amino acids in length. Examples of suitable fragments are described 
herein. 

By the term "highly conserved" is meant at least 80% identity, preferably at least 
90% identity, and more preferably, over 97% identity. Identity is readily determined by 
30 one of skill in the art by resort to algorithms and computer programs known by those of 
skill in the art. 
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Generally, when referring to "identity", "homology", or "similarity" between two 
different adeno-associated viruses, "identity", "homology" or "similarity" is determined 
in reference to "aligned" sequences. "Aligned" sequences or "alignments" refer to 
multiple nucleic acid sequences or protein (amino acids) sequences, often containing 
5 corrections for missing or additional bases or amino acids as compared to a reference 

sequence. In the examples, AAV alignments are performed using the published AAV2 or 
AAVl sequences as a reference point. However, one of skill in the art can readily select 
another AAV sequence as a reference. 

Alignments are performed using any of a variety of publicly or commercially 
1 0 available Multiple Sequence Alignment Programs. Examples of such programs include, 
"Clustal W", "CAP Sequence Assembly", "MAP", and "MEME", which are accessible 
through Web Servers on the internet. Other sources for such programs are known to 
those of skill in the art. Alternatively, Vector NTl utilities are also used. There are also a 
number of algorithms known in the art that can be used to measure nucleotide sequence 
1 5 identity, including those contained in the programs described above. As another example, 
polynucleotide sequences can be compared using Fasta™, a program in GCG Version 
6. 1 . FastaTM provides alignments and percent sequence identity of the regions of the best 
overlap between the query and search sequences. For instance, percent sequence identity 
between nucleic acid sequences can be determined using Fasta^M with its default 
20 parameters (a word size of 6 and the NOPAM factor for the scoring matrix) as provided 
in GCG Version 6.1, herein incorporated by reference. Multiple sequence alignment 
programs are also available for amino acid sequences, e.g., the "Clustal X", "MAP", 
"PIMA", "MSA", "BLOCKMAKER", "MEME", and "Match-Box" programs. 
Generally, any of these programs are used at default settings, although one of skill in the 
25 art can alter these settings as needed. Alternatively, one of skill in the art can utilize 
another algorithm or computer program which provides at least the level of identity or 
alignment as that provided by the referenced algorithms and programs. See. e.g., J. D. 
Thomson et al, Nucl. Acids. Res., "A comprehensive comparison of multiple 
sequence alignments", 27(13):2682-2690 (1999). 



7 



PCT/US2004/028817 

WO 2005/033321 

The tem. "serotype" U a distincUon wi<h respect to an AAV having a capsid 
which is serologically distinct (ron, o^er AAV serotypes. Serolcgtc distinctiveness ,s 
determined on the basis of the lack of cross-reactivity between antibodies to the AAV 

as compared to other AA V . 
5 Cross-reactivity is typically tneasured in a neutraliztng antibody assay. For 

this assay polyclonal serum is generated against a specific AAV in a rabbi, or other 
suitable animal model using the adeno-associated viruses. In this assay, the serum 
generated against a specific AAV is then tested in its ability to neutrali^e either the 
same (homologous) or a heterologous AAV. The dilution that achieves 50% 
,0 neutrali^tion is considered the neutmlizing antibody titer. If for two AAVs the 

nuonem of the heterologous titer divided by the homologous titer is lower than 1 6 m a 
„ciprocal manner, those two vectors are considered as the same serotype. Conversely, 
if the ratio of the heterologous titer over the homologous titer is 16 or more m a 
reciprocal manner the two AAVs are considered distinct serotypes. 
1 5 As defined herein, to form serotype 9, antibodies generated to a 

selected AAV capsid must not be cross-reactive with any of AAV 1, AAV2, AAV3, 
AAV4 AAVS AAV6,AAV7orAAV8. In one embodiment, the present mventton 
provide an AAV capsid of a novel serotype, identified herein, as human AAV 
serotype 9. 

As used throughout this specification and the claims, the terms "comprising'^ and 
"including" are inclusive of other components, elements, integers, steps and the hke. 
Conversely, the tenn "consisting" and its variants are exclusive of other components, 
elements, integers, steps and the like. 

I- Ciades 

In one aspect, the invention provides ciades of AAV. A clade ,s a group of AAV 
which are phylogenetically related to one another as determined using a Neighbor-Jommg 
algorithm by a bootstrap value of at least 75o/o (of at least 1 000 replicates) and a Po.sson 
correction distance measurement of no more than 0.05. based on alignment of the AAV 
vpl amino acid sequence. 
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The Neighbor-Joining algorithm has been described extensively in the literature. 
See eg.,M. Nei and S. Kumar, Molecular Evolution and Phylogemtics (Oxford 
University Press, New York (2000). Computer programs are available that can be used to 
implement this algorithm. For example, the MEGA v2.1 program implements the 
5 modified Nei-Gojobori method. Using these techniques and computer programs, and the 
sequence of an AAV vpl capsid protein, one of skill in the art can readily determme 
whether a selected AAV is contained in one of the clades identified herein, in another 

clade, or is outside these clades. 

While the clades defined herein are based primarily upon naturally occurring 
,0 AAV vpl capsids, the clades are not limited to naturally occurring AAV. The clades can 
encompass non-naturally occurring AAV, including, without limitation, recombmant. 
modified or altered, chimeric, hybrid, synthetic, artificial, etc.. AAV which are 
phylogenetically related as determined using a Neighbor-Joining algorithm at least 75% 
(of at least 1000 replicates) and a Poisson correction distance measuremem of no more 
1 5 than 0.05, based on alignment of the AAV vpl amino acid sequence. 

The clades described herein include Clade A (represented by AAVl and AAV6), 
Clade B (represented by AAV2) and Clade C (represented by the AAV2-AAV3 hybrid), 
Clade D (represented by AAV7), Clade E (represented by AAV8), and Clade F 
(represented by human AAV9). These clades are represented by a member of the clade 
20 that is a previously described AAV serotype. Previously described AAV 1 and AAV6 are 
members of a single clade (Clade A) in which 4 isolates were recovered from 3 humans. 
Previously described AAV3 and AAV5 serotypes are clearly distinct from one another, 
but were not detected in the screen described herein, and have not been included in any of 
these clades. 

25 Clade B (AAV2) and Clade C (the AAV2-AAV3 hybrid) are the most abundant 

of those found in humans (22 isolates from 12 individuals for AAV2 and 17 isolates from 

8 individuals for Clade C). 

There are a large number of sequences grouped in either Clade D (AAV7) or 
Clade E (AAV8). Interestingly, both of these clades are prevalent in differem species. 
30 Clade D is unique to rhesus and cynomologus macaques with 1 5 members being isolated 
from 10 different animals. Clade E is interesting because it is found in both human and 
nonhuman primates: 9 isolates were recovered from 7 humans and 21 isolates were 
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obtained in 9 different nonhuman primates including rliesus macaques, a baboon and a 
pigtail moni<ey. 

In two other animals the hybrid nature of certain sequences was proven, although 
all sequences in this case seem to have originated through individual and different 
5 recombinations of two co-infecting viruses (in both animals a Clade D with a Clade E 
virus). None of these recombinants were identified in other animals or subjects. 

Since Clade C (the AAV2-AAV3 hybrid) clade was identified in 6 different 
human subjects, the recombination event resulted in a fit progeny. In the case of the 
AAV7-AAV8 hybrids on the other hand, only few conclusions can be drawn as to the 
10 implication of recombination in AAV evolution. These recombination events show that 
AAV is capable of recombining, thereby creating in-frame genes and in some cases 
packagable and/or infectious capsid structures. Clade C (the AAV2-AAV3 hybrid clade) 
on the other hand is a group of viruses that has acquired a selective advantage through 
recombination that made them sustain certain environmental pressures. 
1 5 A. Clade A (represented by AAV1 and AAV6): 

In another aspect, the invention provides Clade A, which is characterized 
by containing the previously published AAVl and AAV6. See, e.g.. International 
Publication No. WO 00/28061, 18 May 2000; Rutledge et a\,J Virol, 72(1):309-319 (Jan 
1 998). In addition, this clade contains novel AAV including, without limitation, 
20 128.1/hu. 43 [SEQ ID NOs: 80 and 160]; 128.3/hu. 44 [SEQ ID Nos: 81 and 158]; 
1 30.4/hu.48 [SEQ ID NO: 78 and 1 57]; and hu.46 [SEQ ID NOs: 82 and 1 59]. The 
invention further provides a modified hu. 43 capsid [SEQ ID NO:236] and a modified hu. 

46 capsid [SEQ ID NO:224]. 

In one embodiment, one or more of the members of this clade has a capsid 
25 with an amino acid identity of at least 85% identity, at least 90% identity, at least 95% 

identity, or at least 97% identity over the full-length of the vpl , the vp2, or the vp3 of the 
AAV 1 and/or AA V6 capsid. 

In another embodiment, the invention provides novel AAV of Clade A, 
provided that none of the novel AAV comprises a capsid of any of AAVl or AAV6. 
30 These AAV may include, without limitation, an AAV having a capsid derived from one 
or more of 128.1/hu. 43 [SEQ ID Nos: 80 and 160]; modified hu.43 [SEQ JD NO:236] 
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128.3/hu. 44 [SEQ IDNos: 81 and 158]; hu.46 [SEQ IDNOs: 82 and 159]; modified hu. 
46 [SEQ IDNO:224]; and 130.4/hu.48 [SEQ ID NO: 78 and 157]. 
B. Clade B (AAV2 Clade): 

In another embodiment, the invention provides a Clade B. 
5 This clade is characterized by containing, at a minimum, the previously 

described AAV2 and novel AAV of the invention including, 52/hu.l9 [SEQ ID NOs: 62 
and 133], 52.1/hu.20 [SEQ IDNOs: 63 and 134], 54.5/hu.23 [SEQ ID Nos: 60 and 137], 
54.2/hu.22 [SEQ ID Nos: 67 and 138], 54.7/hu.24 [SEQ ID Nos: 66 and 136], 54.1/hu.21 
[SEQ ID Nos: 65 and 135], 54.4R/hu.27 [SEQ ID Nos: 64 and 140]; 46.2/hu.28 [SEQ ID 
10 Nos: 68 and 130]; 46.6/hu.29 [SEQ IDNos: 69 and 132]; modified hu. 29 [SEQ ID NO: 
225]; 172.1/hu.63 [SEQ ID NO: 171 and 195; GenBank Accession No. AY530624]; 
172.2/hu. 64 [SEQ ID NO: 172 and 196; GenBank Accession No. AY530625]; 
24.5/hu.l3 [SEQ ID NO: 71 and 129; GenBank Accession No. AY530578]; 145.6/hu.56 
[SEQ ID NO: 168 and 192]; hu.57 [SEQ IDNos: 169 and 193]; 136.1/hu.49 [SEQ ID 
15 NO: 165 and 189]; 156.1/hu.58 [SEQ ID NO: 179 and 194]; 72.2/hu.34 [SEQ ID NO: 72 
and 125; GenBank Accession No. AY530598]; 72.3/hu.35 [SEQ ID NO: 73 and 164; 
GenBank Accession No. AY530599]; 130.1/hu.47 [SEQ ID NO: 77 and 128]; 
129.1/hu.45 (SEQ ID NO: 76 and 127; GenBank Accession No. AY530608); 140.l/hu.5l 
[SEQ ID NO: 161 and 190; GenBank Accession No. AY530613]; and 140.2/hu.52 [SEQ 
20 ID NO: 167 and 191; GenBank Accession No. AY530614]. 

In one embodiment, one or more of the members of this clade has a capsid with 
an amino acid identity of at least 85% identity, at least 90% identity, at least 95% identity, 
or at least 97% identity over the full-length of the vpl, the vp2, or the vp3 of the AAV2 
capsid. 

25 In another embodiment, the invention provides novel AAV of Clade B, provided 

that none of the AAV has an AAV2 capsid. These AAV may include, without limitation, 
an AAV having a capsid derived from one or more of the following: 52/hu. 1 9 [SEQ ID 
NOs: 62 and 133], 52.1/hu.20 [SEQ IDNOs: 63 and 134], 54.5/hu.23 [SEQ IDNos: 60 
and 137], 54.2/hu.22 [SEQ ID Nos: 67 and 138], 54.7/hu.24 [SEQ ID Nos: 66 and 136], 

30 54.1/hu.21 [SEQ IDNos: 65 and 135], 54.4R/hu.27 [SEQ IDNos: 64 and 140]; 

46.2/hu.28 [SEQ ID Nos: 68 and 130]; 46.6/hu.29 [SEQ ID Nos: 69 and 132]; modified 
hu. 29 [SEQ ID NO: 225]; 172.1/hu.63 [SEQ ID NO: 171 and 195 ]; 172.2/hu. 64 [SEQ 
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,DNO- 172 a«i 196]; 24.5/hu.B tSEQ ID NO: 71 and 129); 145.6/hu.56 [SEQ ID NO: 
,68 and 192; Oe.Bank Accession No. AY530618): hu.57 (SEQ ID Nos: 169 and 193; 
Q=„Bank Accession No. AY5306191; 136.1/hu.49 [SEQ ID NO: 165 and 189; OenBank 
Accession NO. AY5306121; ,56.1/hu.58[SEQlDNO: 179 and 194; OenBank Accession 
5 NO AY530620); 72.2/hu.34 [SEQ ID NO: 72 and 125); 72.3/h>,.35 (SEQ ID NO: 73 and 
,64]- I29.1/hu.45 [SEQ ID NO: 76and 127); 130.,/hu.47 [SEQ 1DN0:77 and 128; 
OenBank Accession NO. AY5306.01; 140.1/hu.51 [SEQIDNO: 161 and 190; OenBank 
AccessionNo.AY530613);a„d l40.2/h..52 [SEQ ID NO: 167 and 191;0enBank 

Accession No. AY5306141. 

C Clade C (AAV2-AAV3 Hybrid Clade) 

In another aspect, the invention provides Clade C, which is characterized 
by containing AAV that are hybrids of the previously published AA V2 and AA V3 such 
as H-6/hu 4; H-2/hu.2 [US Patent Application 2003/0138772 (Jun 24. 2003). In addition, 
this Clade contains novel AAV inclnding, without lin,ita.ion, 3.1/hu.9 (SEQ ID Nos: 58 
,5 and 155]; 16.8/hu.lO [SEQ ID Nos: 56 and 156); 16.12/hu.l , [SEQ ID Nos: 57 and 153); 
145 1/hu 53 [SEQ ID Nos: 176 and 186); 145.6/hu.55 [SEQ ID Nos: 178 and 187J; 
145 5/hu 54 [SEQ ID Nos: 177 and 188]; 7.3/hu.7 [SEQ ID Nos: 55 and 1 50; now 
deposited as OenBank Accession No. AY5306281; modified hu. 7 [SEQ ID NO: 226); 
33 4/hu 15 [SEQ ID Nos: 50 and 147); 33.8/hu.l6 [SEQ ID Nos: 51 and 148); hu.18 
20 ISEQlDNOs: 52and 149); 58.2*u.25 [SEQ ID Nos: 49 and 146); 161.10/hu.60 [SEQ 
ID Nos: 170 and 184); H-5/hu.3 [SEQ IDNos: 44 and 145); H-l/hu.l [SEQ,DNos:46 
and 144]; and 16l.6/hu.6l [SEQ IDNos: 174 and 185], 

In one embodiment, one or mote of the members of this clade has a caps.d 
with an amino acid identity of at least 85% identity, a. least 90% identity, at least 95% 
25 identity, or at least 97% identity over the full-length of the vpl , the vp2, or the vp3 of the 

hu.4 and/or hu.2 capsid. 

m another embodiment, the invention provides novel AAV of Clade C 
(,he AAV2-AAV3 hybrid clade), provided that none of the novel AAV comprises a 
capsid of hu.2 or hu.4. These AAV may include, without limitation, an AAV havmg a 
,0 capsid derived from one or more of3.l/hu.9 [SEQ IDNos; 58 and 155]; 16.8*u.l0 [SEQ 
IDNOS- 56 and 156); 16.12/hu.l, (SEQ ID Nos: 57 and 153]; ,45.,/hu.53 [SEQ ID Nos: 
176 and 186]; 145.6/hu.55 [SEQ ID Nos: 178 and 187]; ,45.5/hu.54 [SEQ ID Nos: 177 
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and 188]; 7.3/hu.7 [SEQ ID Nos: 55 and 150]; modified hu.7 [SEQ ID NO:226]; 
33,4/hu.l5 [SEQ ID Nos: 50 and 147]; 33.8/hu.l6 [SEQ ID Nos: 51 and 148]; 58.2/hu.25 
[SEQ ID Nos: 49 and 146]; 161.IO/hu.60 [SEQ ID Nos: 170 and 1 84]; H-5/hu.3 (SEQ ID 
Nos: 44 and 145]; H-l/hu.l [SEQ ID Nos: 46 and 144]; and 161.6/hu.6l [SEQ ID Nos: 
5 1 74 and 185]. 

D. Clade D (AAV7 clade) 

In another embodiment, the invention provides Clade D. This clade is 
characterized by containing the previously described AAV7 [G. Gao et al, Proc. Nad 
Acad. Sci USA, 99:1 1854-9 (Sep. 3, 2002). The nucleic acid sequences encoding the 

10 AAV7 capsid are reproduced in SEQ ID NO: 184; the amino acid sequences of the AAV7 
capsid are reproduced in SEQ ID NO: 185. In addition, the clade contains a number of 
previously described AAV sequences, including: cy.2; cy.3; cy.4; cy.5; cy.6; rh.l3; rh.37; 
rh, 36; and rh.35 [US Published Patent Application No. US 2003/0138772 Al (July 24 
2003)]. Additionally, the AAV7 clade contains novel AAV sequences, including, without 

15 limitation, 2-15/ rh.62 [SEQ ID Nos: 33 and 114]; l-7/rh.48 [SEQ ID Nos: 32 and 115]; 
4-9/rh.54[SEQIDNos:40and I 16]; and 4-1 9/rh.55 [SEQ ID Nos: 37 and 117]. The 
invention further includes modified cy. 5 [SEQ ID NO: 227]; modified rh.l3 [SEQ ID 
NO: 228]; and modified rh. 37 [SEQ ID NO: 229]. 

In one embodiment, one or more of the members of this clade has a capsid 

20 with an amino acid identity of at least 85% identity, at least 90% identity, at least 95% 

identity, or at least 97% identity over the full-length of the vpl , the vp2, or the vp3 of the 
AAV7 capsid, SEQ ID NO: 1 84 and 185. 

In another embodiment, the invention provides novel AAV of Clade D, 
provided that none of the novel AAV comprises a capsid of any of cy.2; cy.3; cy,4; cy.5; 

25 cy,6; rh.l3; rh.37; rh. 36; and rh.35. These AAV may include, without limitation, an 
AAV having a capsid derived from one or more of the following 2-15/ rh.62 [SEQ ID 
Nos: 33 and 1 14]; l-7/rh.48 [SEQ ID Nos: 32 and 115]; 4-9/rh,54 [SEQ ID Nos: 40 and 
116]; and 4.19/rh.55 [SEQ ID Nos: 37 and 117]. 

E. Clade E (AAV8 clade) 

30 In one aspect, the invention provides Clade E. This clade is characterized 

by containing the previously described AAV8 [G. Gao et al, Proc. Natl Acad. Sci USA. 
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99:1 1854-9 (Sep. 3, 2002)], 43.1/rh.2; 44.2/rh.lO; rh. 25; 29.3/bb.l ; and 29.5/bb.2 [US 
Published Patent Application No. US 2003/0138772 A 1 (Jul 24 2003)]. 

Further, the clade novel AAV sequences, including, without limitation, 
including, e.g., 30.10/pi.l [SEQ ID NOs: 28 and 93], 30.12/pi.2 [SEQ ID NOs: 30 and 95, 
5 30.19/pi.3 [SEQ ID NOs: 29 and 94], LG-4/rh.38 [SEQ ID Nos: 7 and 86]; LG-iO/rh.40 
[SEQ ID Nos: 14 and 92]; N721-8/rh.43 [SEQ ID Nos: 43 and 163];l-8/rh.49 [SEQ ID 
NOs: 25 and 103]; 2-4/rh.50 [SEQ ID Nos: 23 and 108]; 2-5/rh.51 [SEQ ID Nos: 22 anci 
104]; 3-9/rh.52 [SEQ ID Nos: 18 and 96]; 3-1 1/rh.53 [SEQ ID NOs: 17 and 97]: 5-3/i-h.57 
[SEQ ID Nos: 26 and 105]; 5-22/rh.58 [SEQ ID Nos: 27 and 58]; 2-3/ih.61 [SEQ ID 

10 NOs: 21 and 107]; 4-8/rh.64 [SEQ ID Nos: 15 and 99]; 3.I/hu.6 [SEQ ID NO: 5 and 
84]; 33.12/hu.l7 [SEQ IDNO:4 and 83]; 106.l/hu.37 [SEQ ID Nos: 10 and 88]; LG- 
9/hu.39 [SEQ ID Nos: 24 and 102]; 1 14.3/hu. 40 [SEQ ID Nos: 1 I and 87]; 127.2/lui,4l 
[SEQ ID NO:6 and 91]; 127.5/hu.42 [SEQ ID Nos: 8 and 85]; hu. 66 [SEQ ID NOs: 1 73 
and 197]; and hu.67 [SEQ ID NOs: 174 and 198]. This clade further includes modilled 

15 rh. 2 [SEQ ID NO: 231]; modified rh. 58 [SEQ ID NO: 232]; modified rh. 64 [SEQ ID 
NO: 233]. 

In one embodiment, one or more of the members of this clade has a capsid 
with an amino acid identity of at least 85% identity, at least 90% identity, at least 95% 
identity, or at least 97% identity over the full-length of the vpl , the vp2, or the vp3 of the 
20 AAV8 capsid. The nucleic acid sequences encoding the AAV8 capsid are reproduced in 
SEQ ID NO: 1 86 and the amino acid sequences of the capsid are reproduced in SEQ ID 
NO: 187. 

In another embodiment, the invention provides novel AAV of Clade E, 
provided. that none of the novel AAV comprises a capsid of any of AAV8. rh.8: 

25 44.2/rh.lO; rh. 25; 29.3/bb.I; and 29.5/bb.2 [US Published Patent Application No. US 
2003/0138772 AI (Jul 24 2003)]. These AAV may include, without limitation, an AAV 
having a capsid derived from one or more of the following: 30.10/pi.l [SEQ ID NOs:28 
and 93], 30.12/pi.2 [SEQ IDNOs:30and 95, 30.19/pi.3 [SEQ ID NOs:29 and 94]. l.G- 
4/rh.38 [SEQ ID Nos: 7 and 86]; LG-10/rh.40 [SEQ ID Nos: 14 and 92]; N721-8/rh.43 

30 [SEQ ID Nos: 43 and 1 63]; 1 -8/rh.49 [SEQ ID NOs: 25 and 103J; 2-4/rh.50 [SHQ ID 

Nos: 23 and 108]; 2-5/rh.51 [SEQ ID Nos: 22 and 104]; 3-9/rh.52 [SEQ ID Nos: 1 8 and 
96]; 3-1 l/rh.53 [SEQ ID NOs: 17 and 97]; 5-3/rh.57 [SEQ ID Nos: 26 and 105]; 5- 
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22/rh.58 [SEQ IDNos: 27 and 58]; modified rh. 58 [SEQ ID NO: 232]: 2-3/rh.61 fSEO 
IDNOs: 21 and 107]; 4-8/rh.64 [SEQ ID Nos: 15 and 99]; modified rh. 64[SbQ ID NO: 
233]; 3.1/hu.6 [SEQ ID NO: 5 and 84]; 33,I2/hu.l 7 [SEQ ID NO:4 and 83]; l06.i/luK37 
[SEQ ID Nos: 10 and 88]; LG-9/hu.39 [SEQ ID Nos: 24 and 102]; I 14.3/hu. 40 [SEQ ID 
5 Nos: 11 and 87]; 127,2/hu.41 [SEQ ID NO:6 and 91]; 127.5/hu.42 [SEQ ID Nos: Sand 
85]; hu. 66 [SEQ ID NOs: 1 73 and 1 97]; and hu.67 [SEQ ID NOs: 1 74 and I 98]. 
F. Glade F (AAV 9 Glade) 

This clade is identified by the name of a novel AAV serotype identified 
herein as hu.]4/AAV9 [SEQ ID Nos: 3 and 123]. In addition, this clade contains other 
10 novel sequences including, hu.31 [SEQ ID NOs:l and 121]; and hu.32 [SEQ ID Nos: 2 
and 122]. 

In one embodiment, one or more of the members of this clade has a capsid 
with an amino acid identity of at least 85% identity, at least 90% ideniity. at least 95% 
identity, or at least 97% identity over the full-length of the vp 1 , the vp2. or the vp3 of ihc 
1 5 AA V9 capsid, SEQ ID NO: 3 and 1 23. 

In another embodiment, the invention provides novel AAV of Clade F. 
which include, without limitation, an AAV having a capsid derived from one or more of 
hu.l4/AAV9 [SEQ IDNos: 3 and 123], hu.31 [SEQ IDNOs:l and 121] and hu.32 [SRQ 
ID Nos: I and 122]. 

20 The AAV clades of the invention are useful for a variety of purposes, including 

providing ready collections of related AAV for generating viral vectors, and for 
generating targeting molecules. These clades may also be used as tools for a variety of 
purposes that will be readily apparent to one of skill in the art. 

25 II. NOVEL AAV SEQUENGES 

The invention provides the nucleic acid sequences and amino acids of a novel 
AAV serotype, which is termed interchangeably herein as clone hu. 14/28.4 and huAAV9. 
These sequences are useful for constructing vectors that are highly efficient in 
transduction of liver, muscle and lung. This novel AAV and its sequences are also useful 
30 for a variety of other purposes. These sequences are being submitted with GenBank and 
have been assigned the accession numbers identified herein. 
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The invention further provides the nucleic acid sequences and amino acid 
sequences of a number of novel AAV. Many of these sequence include those described 
above as members of a clade, as summarized below. 

128.1/hu. 43 [SEQ ID Nos: 80 and 160 GenBank Accession No. 
5 AY530606]; modified hu. 43 [SEQ ID NO:236]; i28.3/lui. 44 [SEQ ID Nos: 8 1 and 158; 
GenBank Accession No. AY530607] and 130.4/hu.48 [SEQ ID NO: 78 and 157; 
GenBank Accession No. AY53061 1]; from the Clade A; 

52/hu.l9 [SEQ ID NOs: 62 and 133; GenBank Accession No. 
AY530584], 52.1/hu.20 [SEQ ID NOs: 63 and 1 34; GenBank Accession No. AY530586], 
10 54.5/hu.23 [SEQ ID Nos: 60 and 137; GenBank Accession No. AY530589], 54.2/hu.22 
[SEQ ID Nos: 67 and 138; GenBank Accession No. AY530588], 54.7/hu.24 [SEQ ID 
Nos: 66 and 136; GenBank Accession No. AY530590], 54.1/hu.21 [SEQ ID Nos: 65 and 
135; GenBank Accession No. AY530587], 54.4R/hu.27 [SEQ ID Nos: 64 and 140; 
GenBank Accession No. AY530592]; 46.2/hu.28 [SEQ ID Nos: 68 and 1 30; GenBank 
1 5 Accession No. AY530593]; 46.6/hu.29 [SEQ ID Nos: 69 and 1 32; GenBank Accession 
No. AY530594]; modified hu. 29 [SEQ ID NO: 225]; I72.l/hu.63 [SEQ ID NO: 171 and 
195]; and 140.2/hu.52 (SEQ ID NO: 167 and 191 ; from Clade B; 

3.1/hu.9 [SEQ ID Nos: 58 and 155; GenBank Accession No. AY530626]; 
l6.8/hu.lO [SEQ ID Nos: 56 and 156; GenBank Accession No. AY530576]; I6.12/hu.l 1 
20 [SEQ ID Nos: 57 and 153; GenBank Accession No. AY530577]; 145.1/hu.53 [SEQ ID 
Nos: 176 and 1 86; GenBank Accession No. A Y5306 15]; 145.6/hu.55 [SEQ ID Nos: 178 
and 187; GenBank Accession No. AY530617]; 145.5/hu.54 [SEQ ID Nos: 177 and 188; 
GenBank Accession No. AY530616]; 7.3/hu.7 [SEQ ID Nos: 55 and 150; GenBank 
Accession No. AY530628]; modified hu. 7 [SEQ ID NO: 226]; hu.l 8 [SEQ ID Nos: 52 
25 and 149; GenBank Accession No. AY5305831; 33.4/hu.l 5 [SEQ ID Nos: 50 and 147; 
GenBank Accession No. AY530580]; 33.8/hu.l6 [SEQ ID Nos: 51 and 148; GenBank 
Accession No. AY530581]; 58.2/hu.25 [SEQ ID Nos: 49 and 146: GenBank Accession 
No. AY530591]; 161.10/hu.60 [SEQ ID Nos: 170 and 184; GenBank Accession No. 
AY530622]; H-5/hu.3 [SEQ ID Nos: 44 and 145; GenBank Accession No. AY530595]; 
30 H-l/hu.l [SEQ ID Nos: 46 and 144; GenBank Accession No. AY530575]; and 

I61.6/hu.61 [SEQ ID Nos: 174 and 185; GenBank Accession No. A Y530623] from Clade 

C; 
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2-15/ rh.62 [SEQ IDNos: 33 and 1 14; GenBank Accession No. 
AY530573]; l-7/rh.48 [SEQ ID Nos: 32 and 115; GenBank Accession No. AY530561]: 
4-9/rh.54 [SEQ ID Nos: 40 and 1 16; GenBank Accession No. AY530567]; and 4- 
19/rh.55 [SEQ ID Nos: 37 and 1 17; GenBank Accession No. AY530568]; modified cy. 5 
5 [SEQ ID NO: 227]; modified rh. 1 3 [SEQ ID NO: 228]; and modified rh. 37 [SEQ I D NO: 
229] from the Clade D; 

30.10/pi.l [SEQ lDNOs:28 and 93; GenBank Accession No. AY53055], 
30.12/pi.2 [SEQ ID NOs:30 and 95; GenBank Accession No. AY 530554], 30. 19/pi.3 
[SEQ ID NOs:29 and 94; GenBank Accession No. AY530555], LG-4/rh.38 [SEQ ID 

10 Nos: 7 and 86; GenBank Accession No. AY 530558]; LG- 1 0/rh.40 [SEQ ID Nos: 1 4 and 
92; GenBank Accession No. AY530559]; N721-8/rh.43 (SEQ ID Nos: 43 and 163; 
GenBank Accession No. AY530560];l-8/rh.49 [SEQ ID NOs: 25 and 1 03; GenBank 
Accession No. AY530561]; 2-4/rh.50 [SEQ ID Nos: 23 and 108; GenBank Accession 
No. AY530563]; 2-5/rh.51 [SEQ ID Nos: 22 and 104; GenBank Accession No. 530564]; 

15 3-9/rh.52 [SEQ ID Nos: 18 and 96; GenBank Accession No. AY530565]; 3-1 l/rh.53 

[SEQ IDNos: 17 and 97;GenBank Accession No. AY530566]; 5-3/rh.57 [SEQ ID Nos: 
26 and 105; GenBank Accession No. AY530569]; 5-22/rh.58 [SEQ ID Nos: 27 and 58; 
GenBank Accession No. 530570]; modified rh. 58 [SEQ ID NO: 232]; 2-3/rh.6l [SEQ ID 
Nos: 21 and 107; GenBank Accession No. AY530572]; 4-8/rh.64 [SEQ ID Nos: 15 and 

20 99; GenBank Accession No. AY530574]; modified rh. 64[SEQ ID NO: 233]; 3. i/hu.6 
[SEQ ID NO: 5 and 84; GenBank Accession No. AY530621]; 33.12/hu.l7 [SEQ ID 
NO:4 and 83; GenBank Accession No. AY530582]; 106.1/hu.37 [SEQ ID Nos: 10 and 
88; GenBank Accession No. AY530600]; LG-9/hu.39 [SEQ ID Nos: 24 and 102; 
GenBank Accession No. AY530601]; 1 14.3/hu. 40 [SEQ ID Nos: 1 1 and 87; GenBank 

25 Accession No. AY530603]; 127.2/hu.41 [SEQ ID N0:6 and 91; GenBank Accession No. 
AY530604]; 127.5/hu.42 [SEQ ID Nos: 8 and 85; GenBank Accession No. AY530605]; 
and hu. 66 [SEQ ID NOs: 173 and 197; GenBank Accession No. AY530626]; and lui.67 
[SEQ ID NOs: 1 74 and 1 98; GenBank Accession No. AY530627]; and modified rh.2 
[SEQ ID NO:23 1 ]; from Clade E; 

30 hu.l4/AAV9 [SEQ ID Nos: 3 and 123; GenBank Accession No. 

AY530579], hu.31 [SEQIDNOs:! and 121 ; AY530596] and hu.32 [SEQ IDNos: I and 
122; GenBank Accession No. AY5305971 from Clade F. 
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In addition, the present invention provides AAV sequences, including, 
rh.59 [SEQ ID NO: 49 and 110];rh.60 [SEQ ID NO: 31 and 120; GenBank Accession 
No. AY530571], modified ch.5 [SEQ ID NO: 234]; and modified rh. 8 [SEQ ID NO: 
235], which are outside the definition of the ciades described above. 
5 ' Also provided are fragments ofthe AAV sequences of the invention. 

Each of these fragments may be readily utilized in a variety of vector systems and host 
cells. Among desirable AAV fragments are the cap proteins, including the vp 1 , vp2, vp3 
and hypervariable regions. Where desired, the methodology described in published US 
Patent Publication No. US 2003/0138772 Al (July 24 2003)] can be used to obtain the 
10 rep sequences for the AAV clones identified above. Such rep sequences include, e.g., rep 
78. rep 68, rep 52, and rep 40, and the sequences encoding these proteins. Similarly, 
other fragments of these clones may be obtained using the techniques described in the 
referenced patent publication, including the AAV inverted terminal repeat (ITRs), AAV 
P19 sequences, AAV P40 sequences, the rep binding site, and the terminal resolute site 
15 (TRS). Still other suitable fragments will be readily apparent to those of skill in the art. 

The capsid and other fragments ofthe invention can be readily utilized in 
a variety of vector systems and host cells. Such fragments may be used alone, in 
combination with other AAV sequences or fragments, or in combination with elements 
from other AAV or non-AAV viral sequences. In one particularly desirable embodiment, 
20 a vector contains the AAV cap and/or rep sequences ofthe invention. 

The AAV sequences and fragments thereof are useful in production of 
rAAV, and are also useful as antisense delivery vectors, gene therapy vectors, or vaccine 
vectors. The invention further provides nucleic acid molecules, gene delivery vectors, 
and host cells which contain the AAV sequences of the invention. 

Suitable fragments can be determined using the information provided 



25 



30 



herein. 

As described herein, the vectors ofthe invention containing the AAV 
capsid proteins ofthe invention are particularly well suited for use in applications in 
which the neutralizing antibodies diminish the effectiveness of other AAV serotype based 
vectors, as well as other viral vectors. The rAAV vectors of the invention are particularly 
advantageous in rAAV readministration and repeat gene therapy. 



18 



wo 2005/033321 



PCT/US2004/028817 



These and other embodiments and advantages of the invention are 
described in more detail below. 

A. AAV Serotype 9/hu14 Sequences 

The invention provides the nucleic acid sequences and amino acids of a 
5 novel AAV, which is termed interchangeable herein as clone hu.l4 (formerly termed 
28.4) and huAAV9. As defined herein, novel serotype AAV9 refers to AAV having a 
capsid which generates antibodies which cross-react serologically with the capsid having 
the sequence of hu. 14 [SEQ ID NO: 123] and which antibodies do not cross-react 
serologically with antibodies generated to the capsids of any of AAVl, AAV2, AAV3, 
10 AAV4, AAV5, AAV6, AAV7 or AAV8. 

1 . Nucleic Acid Sequences 

The AAV9 nucleic acid sequences of the invention include the 
DNA sequences of SEQ ID NO: 3, which consists of 22 11 nucleotides. 

The nucleic acid sequences of the invention further encompass the 
1 5 strand which is complementary to SEQ ID NO: 3, as well as the RNA and cDNA 

sequences corresponding to SEQ ID NO: 3, and its complementary strand. Also included 
in the nucleic acid sequences of the invention are natural variants and engineered 
modifications of SEQ ID NO: 3 and its complementary strand. Such modifications 
include, for example, labels that are known in the art, methylation, and substitution of one 
20 or more of the naturally occurring nucleotides with a degenerate nucleotide. 

Further included in this invention are nucleic acid sequences 
which are greater than about 90%, more preferably at least about 95%, and most 
preferably at least about 98 to 99%, identical or homologous to SEQ ID NO: 3. 

Also included within the invention are fragments of SEQ ID NO: 
25 3, its complementary strand, and cDNA and RNA complementary thereto. Suitable 

fragments are at least 15 nucleotides in length, and encompass functional fragments, i.e.. 
fragments which are of biological interest. Such fragments include the sequences 
encoding the three variable proteins (vp) of the AAV9/HU.14 capsid which are alternative 
splice variants: vpl [nt 1 to 221 1 of SEQ ID NO:3]; vp2 [about nt 41 1 to 221 1 of SEQ ID 
30 N0:3]; and vp 3 [about nt 609 to 221 1 of SEQ ID NO:3]. Other suitable fragments of 
SEQ ID NO: 3, include the fragment which contains the start codon for the AA V9/HU. 1 4 
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capsid protein, and the fragments encoding the hypervariable regions of the vpl capsid 
protein, which are described herein. 

In addition to including the nucleic acid sequences provided in the 
figures and Sequence Listing, the present invention includes nucleic acid molecules and 
5 sequences which are designed to express the amino acid sequences, proteins and peptides 
of the AAV serotypes of the invention. Thus, the invention includes nucleic acid 
sequences which encode the following novel AAV amino acid sequences and artificial 
AAV serotypes generated using these sequences and/or unique fragments thereof. 

As used herein, artificial AAV serotypes include, without 
10 limitation, AAVs with a non-naturally occurring capsid protein. Such an artificial capsid 
may be generated by any suitable technique, using a novel AAV sequence of the 
invention (e.g., a fragment of a vpl capsid protein) in combination with heterologous 
sequences which may be obtained from another AAV serotype (known or novel), non- 
contiguous portions of the same AAV serotype, from a non-AA V viral source, or from a 
15 non-viral source. An artificial AAV serotype may be, without limitation, a chimeric 
AAV capsid, a recombinant AAV capsid, or a "humanized" AAV capsid. 

2. HU.14/AAV9 Amino Acid Sequences, Proteins and 

Peptides 

The invention further provides proteins and fragments thereof 
20 which are encoded by the hu.l4/AAV9 nucleic acids of the invention, and hu.l4/AAV9 
proteins and fragments which are generated by other methods. As used herein, these 
proteins include the assembled capsid. The invention further encompasses AAV 
serotypes generated using sequences of the novel AAV serotype of the invention, which 
are generated using synthetic, recombinant or other techniques known to those of skill in 
25 the art. The invention is not limited to novel AAV amino acid sequences, peptides and 
proteins expressed from the novel AAV nucleic acid sequences of the invention, but 
encompasses amino acid sequences, peptides and proteins generated by other methods 
known in the art, including, e.g., by chemical synthesis, by other synthetic techniques, or 
by other methods. The sequences of any of the AAV capsids provided herein can be 
30 readily generated using a variety of techniques. 

Suitable production techniques are well known to those of skill in 
the art. See, e.g., Sambrook et al, Molecular Cloning: A Laboratory Manual, Cold 
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Spring Harbor Press (Cold Spring Harbor, NY). Alternatively, peptides can also be 
synthesized by the well-known solid phase peptide synthesis methods (Merrifield, ./. Am. 
Chem. Soc, 85:2149 (1962); Stewart and Young, Solid Phase Peptide Synthesis 
(Freeman. San Francisco, 1969) pp. 27-62). These and other suitable production methods 
5 are within the knowledge of those of skill in the art and are not a limitation of the present 
invention. 

Particularly desirable proteins include the AAV capsid proieins. 
which are encoded by the nucleotide sequences identified above. The AAV capsid is 
composed of three proteins, vpl, vp2 and vp3, which are alternative splice variants. The 
10 full-length sequence provided in Fig. 2 is that of vpl. The AAV9/HU.14 capsid prote.ns 
include vpl [amino acids (aa) I to 736 of SEQ ID NO: 123 ], vp2 [about aa 1 38 to 736 ot 
SEQIDNO: 123], vp3 [about aa 203 to 736 of SEQ ID NO: 123], and functional 
fragments thereof. Other desirable fragments of the capsid protein include the constant 
and variable regions, located between hypervariable regions (HVR). Other desirable 
15 fragments of the capsid protein include the HVR themselves. 

An algorithm developed to determine areas of sequence 
divergence in AAV2 has yielded 12 hypervariable regions (HVR) of which 5 overlap or 
are part of the four previously described variable regions. [Chiorini el al, J. Virol, 
731309-19 (1999); Rutledge et al,J. Virol., 72:309-319] Using this algorithm and/or the 
20 alignment techniques described herein, the HVR of the novel AAV serotypes are 

determined. For example, the HVR are located as follows: HVRl, aa 146-152; HVR2, 
aa 1 82-186; HVR3, aa 262-264; HVR4, aa 381-383; HVR5, aa 450-474; HVR6, aa 490- 
495; HVR7, aa 500-504; HVR8, aa 514-522; HVR9, aa 534-555; HVRIO, aa 58 1-594; 
HVRl I, aa 658-667; and HVR12, aa 705-719 [the numbering system is based on an 
25 alignment which uses the AAV2 vpl as a point of reference]. Using the alignment 

provided herein performed using the Clustal X program at default settings, or using other 
commercially or publicly available alignment programs at default settings such as are 
described herein, one of skill in the art can readily determine corresponding fragments of 
the novel AAV capsids of the invention. 

Still other desirable fragments of the AAV9/HU. 14 capsid protein 
include amino acids 1 to 184 of SEQ ID NO: 123, amino acids 199 to 259; amino acids 
274 to 446; amino acids 603 to 659; amino acids 670 to 706; amino acids 724 to 736 of 
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SEQ ID NO- 123; aa 1 85 - 198; aa 260-273; aa447-477; aa495-602; aa660-669; and 
aa707-723 Additionally, examples of other suitable fragments of AAV capsids .nclode, 
with respect to the numbering of AAV9 [SEQ ID NO: 123], aa 24 - 42, aa 25 - 28; aa 8 1 
-85- aal33-l65;aa 134- i65;aa 137-143; aa ,54-156; aa ,94-208; aa 261-274: aa 262- 
5 274'aa 171-173; aa 413-417; aa 449-478; aa 494-525; aa 534-571; aa 581-601; aa 660- 
671- aa 709-723. Using the alignment provided herein performed using the Custa, X 
program at default settings, or using other commercially or publicly available alignment 
programs at default settings, one of skill in the art can readily determine correspondmg 
fragments of the novel AAV capsids of the invention. 

Still other desirable AAV9/HU.14 proteins include the rep 

proteins include rcp68/78 and rep40/52. 

Suitably, fragments are at least 8 amino acids in length. However, 

fragments of other desired lengths may be readily utilized. Such fragments may be 
produced recombinantly or by other suitable means, e.g., chemical synthes.s. 

The invention further provides other AAV9/HU.14 sequences 
which are identified using the sequence information provided herein. For example, given 
the AAV9/HU 14 sequences provided herein, infectious AAV9/HU.14 may be .solated 
using genome walking technology (Siebert et al., 1995. Nucleic Acid Research, 23: 1 087- 
,088 Friezner-Degen etai, 1986, J. Biol. Chem. 261:6972-6985. BD Biosciences 
Clomech, Palo Alto. CA). Genome walking is particularly well suited for ident.tymg 
and isolating the sequences adjacent to the novel sequences identified according to the 
method of the invention. This technique is also useful for isolating inverted termunal 
repeat (ITRs) of the novel AAV9/HU.14 serotype, based upon the novel AAV caps.d and 

rep sequences provided herein. 
25 The sequences, proteins, and fragments of the invention may be 

produced by any suitable means, including recombinant production, chemical synthesis, 
or other synthetic means. Such production methods are within the knowledge of those o, 
skill in the art and are not a limitation of the present invention. 

30 III Production of rAAV with Novel AAV Capsids 

The invention encompasses novel AAV capsid sequences of which are free of 
DN A and/or cellular material with these viruses are associated in nature. To avoid 
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repeating all of the novel AAV capsids provided herein, reference is made throughout this 
and the following sections to the hu.l4/AAV9 capsid. However, it should be appreciated 
that the other novel AAV capsid sequences of the invention can be used in a similar 
manner. 

5 In another aspect, the present invention provides molecules that utilize the novel 

AAV sequences of the invention, including fragments thereof, for production of 
molecules useful in delivery of a heterologous gene or other nucleic acid sequences to a 
target cell. 

In another aspect, the present invention provides molecules that utilize the AAV 

10 sequences of the invention, including fragments thereof, for production of viral vectors 
useful in delivery of a heterologous gene or other nucleic acid sequences to a target cell. 

The molecules of the invention which contain AAV sequences include any 
genetic element (vector) which may be delivered to a host cell, e.g., naked DNA. a 
plasmid, phage, transposon, cosmid, episome, a protein in a non-viral delivery vehicle 

1 5 {e.g., a lipid-based carrier), virus, etc., which transfers the sequences carried thereon. 
The selected vector may be delivered by any suitable method, including transfection. 
electroporation, liposome delivery, membrane fusion techniques, high velocity DNA- 
coated pellets, viral infection and protoplast fusion. The methods used to construct any 
embodiment of this invention are known to those with skill in nucleic acid manipulation 

20 and include genetic engineering, recombinant engineering, and synthetic techniques. See, 
e,g,, Sambrook et al. Molecular Cloning: A Laboratory Manual, Cold Spring Harbor 
Press, Cold Spring Harbor, NY. 

In one embodiment, the vectors of the invention contain, inter alia, sequences 
encoding an AAV capsid of the invention or a fragment thereof In another embodiment. 

25 the vectors of the invention contain, at a minimum, sequences encoding an AAV rep 
protein or a fragment thereof Optionally, vectors of the invention may contain both 
AAV cap and rep proteins. In vectors in which both AAV rep and cap are provided, the 
AAV rep and AAV cap sequences can originate from an AAV of the same clade. 
Alternatively, the present invention provides vectors in which the rep sequences are from 

30 an AAV source which differs from that which is providing the cap sequences. In one 
embodiment, the rep and cap sequences are expressed from separate sources (e.g., 
separate vectors, or a host cell and a vector). In another embodiment, these rep sequences 
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are fused in frame to cap sequences of a different AAV source to form a chimeric AAV 
vector. Optionally, the vectors of the invention are vectors packaged in an AAV capsid of 
the invention. These vectors and other vectors described herein can further contain a 
minigene comprising a selected transgene which is flanked by AAV 5' ITR and AAV 3' 
ITR. 

Thus, in one embodiment, the vectors described herein contain nucleic acid 
sequences encoding an intact AAV capsid which may be from a single AAV .^quence 
(e.g.. AAV9/HU.14). Such a capsid may comprise amino acids 1 to 736 of SEQ ID 
NO:'l23. Alternatively, these vectors contain sequences encoding artificial capsids which 
contain one or more fragments of the AAV9/HU.14 capsid fused to heterologous AAV or 
non-AAV capsid proteins (or fragments thereof). These artificial capsid proteins are 
selected from non-contiguous portions of the AAV9/HU.14 capsid or from capsids of 
other AAVs. For example, a rAAV may have a capsid protein comprising one or more ot 
the AAV9/HU.14 capsid regions selected from the vp2 and/or vp3, or from vp 1 , or 
15 fragments thereof selected from amino acids 1 to 184, amino acids 199 to 259; amino 
acids 274 to 446; amino acids 603 to 659; amino acids 670 to 706; amino acids 724 to 
738 of the AAV9/HU.14 capsid, SEQ ID NO: 123. In another example, ii may be 
desirable to alter the start codon of the vp3 protein to GTG. Alternatively, the rAAV may 
contain one or more of the AAV serotype 9 capsid protein hypervariable regions which 
20 are identified herein, or other fragment including, without limitation, aa 1 85 - 1 98; aa 

260-273; aa447-477; aa495-602; aa660-669; and aa707-723 of the AAV9/HU.14 capsid. 
See, SEQ ID NO: 123. These modifications may be to increase expression, yield, and/or 
to improve purification in the selected expression systems, or for another desired purpose 
{e.g., to change tropism or alter neutralizing antibody epitopes). 
25 ' The vectors described herein, e.g., a plasmid. are useful for a variety of purposes, 

but are particularly well suited for use in production of a rAAV containing a capsid 
comprising AAV sequences or a fragment thereof These vectors, including rAAV. their 
elements, construction, and uses are described in detail herein. 

in one aspect, the invention provides a method of generating a recombinant 
adeno-associated virus (AAV) having an AAV serotype 9 capsid. or a portion thereof 
Such a method involves cuhuring a host cell which contains a nucleic acid sequence 
encoding an AAV serotype 9 capsid protein, or fragment thereof, as defined herein; a 



30 



24 



wo 2005/033321 



PCT/US2004/028817 



functional rep gene; a minigene composed of, at a minimum, AAV inverted terminal 
repeats (ITRs) and a transgene; and sufficient helper functions to permit packaging of the 
minigene into the AAV9/HU.14 capsid protein. 

The components required to be cultured in the host cell to package an AAV 
5 minigene in an AAV capsid may be provided to the host cell in trans. Alternatively, any 
one or more of the required components {e.g., minigene, rep sequences, cap sequences, 
and/or helper functions) may be provided by a stable host cell which has been engineered 
to contain one or more of the required components using methods known to those of skill 
in the art. Most suitably, such a stable host cell will contain the required component(s) 
10 under the control of an inducible promoter. However, the required component(s) may be 
under the control of a constitutive promoter. Examples of suitable inducible and 
constitutive promoters are provided herein, in the discussion of regulatory elements 
suitable for use with the transgene. In still another alternative, a selected stable host cell 
may contain selected component(s) under the control of a constitutive promoter and other 
1 5 selected component(s) under the control of one or more inducible promoters. For 
example, a stable host cell may be generated which is derived from 293 cells (which 
contain El helper functions under the control of a constitutive promoter), but which 
contains the rep and/or cap proteins under the control of inducible promoters. Still other 
stable host cells may be generated by one of skill in the art. 
20 The minigene, rep sequences, cap sequences, and helper functions required for 

producing the rAAV of the invention may be delivered to the packaging host cell in the 
form of any genetic element which transfer the sequences carried thereon. The selected 
genetic element may be delivered by any suitable method, including those described 
herein. The methods used to construct any embodiment of this invention are known to 
25 those with skill in nucleic acid manipulation and include genetic engineering, 

recombinant engineering, and synthetic techniques. See, e.g., Sambrook et al, Molecular 
Cloning: A Laboratory Manual, Cold Spring Harbor Press, Cold Spring Harbor, NY. 
Similarly, methods of generating rAAV virions are well known and the selection of a 
suitable method is not a limitation on the present invention. See, e.g., K. Fisher et al, J. 
30 Viroi, 70:520-532 (1993) and US Patent No. 5,478,745. 

Unless otherwise specified, the AAV ITRs, and other selected AAV components 
described herein, may be readily selected from among any AAV, including, without 
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limitation, AAVl, AAV2, AAV3, AAV4, AAV5, AAV6, AAV7, AAV9 and one of the 
other novel AAV sequences of the invention. These ITRs or other AAV components may 
be readily isolated using techniques available to those of skill in the art from an AAV 
sequence. Such AAV may be isolated or obtained from academic, commercial, or public 
5 sources (e.g., the American Type Culture Collection, Manassas, VA). Alternatively, the 
AAV sequences may be obtained through synthetic or other suitable means by reference 
to published sequences such as are available in the literature or in databases such as, e.g., 
GenBank®, PubMed®, or the like. 

A. The Minigene 

10 The minigene is composed of, at a minimum, a transgene and its 

regulatory sequences, and 5' and 3' AAV inverted terminal repeats (ITRs). In one 
desirable embodiment, the ITRs of AAV serotype 2 are used. However, ITRs from other 
suitable sources may be selected. It is this minigene that is packaged into a capsid protein 
and delivered to a selected host cell. 

15 1. The transgene 

The transgene is a nucleic acid sequence, heterologous to 
the vector sequences flanking the transgene, which encodes a polypeptide, protein, or 
other product, of interest. The nucleic acid coding sequence is operatively linked to 
regulatory components in a manner which permits transgene transcription, translation, 

20 and/or expression in a host cell. 

The composition of the transgene sequence will depend 
upon the use to which the resulting vector will be put. For example, one type of 
transgene sequence includes a reporter sequence, which upon expression produces a 
detectable signal. Such reporter sequences include, without limitation, DNA sequences 

25 encoding p-lactamase, (3-galactosidase (LacZ), alkaline phosphatase, thymidine kinase, 
green fluorescent protein (GFP), enhanced GFP (EGFP), chloramphenicol 
acetyltransferase (CAT), luciferase, membrane bound proteins including, for example, 
CD2, CD4, CDS, the influenza hemagglutinin protein, and others well known in the art, 
to which high affinity antibodies directed thereto exist or can be produced by 

30 conventional means, and fusion proteins comprising a membrane bound protein 

appropriately fused to an antigen tag domain from, among others, hemagglutinin or Myc. 
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These coding sequences, when associated with regulatory 
elements which drive their expression, provide signals detectable by conventional means, 
including enzymatic, radiographic, colorimetric, fluorescence or other spectrographic 
assays, fluorescent activating cell sorting assays and immunological assays, including 

5 enzyme linked immunosorbent assay (ELISA), radioimmunoassay (RIA) and 

immunohistochemistry. For example, where the marker sequence is the LacZ gene, the 
presence of the vector carrying the signal is detected by assays for bela-galaciosidase 
activity. Where the transgene is green fluorescent protein or luciferase, the vector 
carrying the signal may be measured visually by color or light production in a 

10 luminometer. 

However, desirably, the transgene is a non-marker 
sequence encoding a product which is useful in biology and medicine, such as proteins, 
peptides, RNA, enzymes, dominant negative mutants, or catalytic RN As. Desirable 
RNA molecules include tRNA, dsRNA, ribosomal RNA. catalytic RNAs, siRNA, small 
1 5 hairpin RNA, trans-splicing RNA, and antisense RNAs. One example of a useful RNA 
sequence is a sequence which inhibits or extinguishes expression of a targeted nucleic 
acid sequence in the treated animal. Typically, suitable target sequences include 
oncologic targets and viral diseases. See, for examples of such targets the oncologic 
targets and viruses identified below in the section relating to immunogens. 

The transgene may be used to correct or ameliorate gene 
deficiencies, which may include deficiencies in which normal genes are expressed at less 
than normal levels or deficiencies in which the functional gene product is not expressed. 
Alternatively, the transgene may provide a product to a cell which is not natively 
expressed in the cell type or in the host. A preferred type of transgene sequence encodes 
25 a therapeutic protein or polypeptide which is expressed in a host cell. The invention 

further includes using multiple transgenes. In certain situations, a different transgene may 
be used to encode each subunit of a protein, or to encode different peptides or proteins. 
This is desirable when the size of the DNA encoding the protein subunit is large, e.g., for 
an immunoglobulin, the platelet-derived growth factor, or a dystrophin protein. In order 
30 for the cell to produce the multi-subunit protein, a cell is infected with the recombinant 
virus containing each of the different subunits. Alternatively, different subunits of a 
protein may be encoded by the same transgene. In this case, a single transgene includes 
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the DNA encoding each of the subunits, with the DNA for each subunit separated by an 
internal ribozyme entry site (IRES). This is desirable when the size of the DNA encoding 
each of the subunits is small, e.g., the total size of the DNA encoding the subunits and the 
IRES is less than five kilobases. As an alternative to an IRES, the DNA may be separated 
5 by sequences encoding a 2A peptide, which self-cleaves in a post-translationa! event. 

See, e.g., M.L. Donnelly, etal.J. Gen. Virol., 78(Pt 1): 13-21 (Jan 1997); Furler, S., et al. 
Gene Ther., 8(1 l):864-873 (June 2001); Klump H., al.. Gene Ther, 8(I0):8I 1-817 
(May 2001), This 2A peptide is significantly smaller than an IRES, making it well suited 
for use when space is a limiting factor. More often, when the transgene is large, consists 

10 of multi-subunits, or two transgenes are co-delivered, rAAV carrying the desired 

transgene(s) or subunits are co-administered to allow them to concatamerize in vivo to 
form a single vector genome. In such an embodiment, a first AAV may carry an 
expression cassette which expresses a single transgene and a second AAV may carry an 
expression cassette which expresses a different transgene for co-expression in the host 

1 5 cell. However, the selected transgene may encode any biologically active product or 
other product, e.g., a product desirable for study. 

Suitable transgenes may be readily selected by one of skill 
in the art. The selection of the transgene is not considered to be a limitation of this 
invention. 

20 2. Regulatory Elements 

In addition to the major elements identified above for the 
minigene, the vector also includes conventional control elements which are operably 
linked to the transgene in a manner which permits its transcription, translation and/or 
expression in a cell transfected with the plasmid vector or infected with the virus 

25 produced by the invention. As used herein, "operably linked" sequences include both 

expression control sequences that are contiguous with the gene of interest and expression 
control sequences that act in trans or at a distance to control the gene of interest. 

Expression control sequences include appropriate transcription 
initiation, termination, promoter and enhancer sequences; efficient RNA processing 

30 signals such as splicing and polyadenylation (polyA) signals; sequences that stabilize 

cytoplasmic mRNA; sequences that enhance translation efficiency {i.e., Kozak consensus 
sequence); sequences that enhance protein stability; and when desired, sequences that 
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enhance secretion of the encoded product. A great number of expression control 
sequences, including promoters which are native, constitutive, inducible and/or tissue- 
specific, are known in the art and may be utilized. 

Examples of constitutive promoters include, without 
5 limitation, the retroviral Rous sarcoma virus (RSV) LTR promoter (optionally with the 
RSV enhancer), the cytomegalovirus (CMV) promoter (optionally with the CMV 
enhancer) [see, e.g., Boshart et al. Cell, 41:521-530 (1985)], the SV40 promoter, the 
dihydrofolate reductase promoter, the p-actin promoter, the phosphoglycerol kinase 
(PGK) promoter, and the EFl promoter [Invitrogen]. Inducible promoters allow 
10 regulation of gene expression and can be regulated by exogenously supplied compounds, 
environmental factors such as temperature, or the presence of a specific physiological 
state, e.g., acute phase, a particular differentiation state of the cell, or in replicating cells 
only. Inducible promoters and inducible systems are available from a variety of 
commercial sources, including, without limitation, Invitrogen, Clontech and Ariad. Many 
15 other systems have been described and can be readily selected by one of skill in the art. 
Examples of inducible promoters regulated by exogenously supplied compounds, include, 
the zinc-inducible sheep metallothionine (MT) promoter, the dexamethasone (Dex)- 
inducible mouse mammary tumor virus (MMTV) promoter, the T7 polymerase promoter 
system [International Patent Publication No. WO 98/10088]; the ecdysone insect 
20 ^ron.oX^r^oetal,Proc.mtl.Acad.Sci. 93:3346-3351 (1996)]. the tetracycline- 
repressible system [Gossen et al, Proc. Natl. Acad. Sci. USA, 89:5547-555 1 (1 992)]. the 
tetracycline-inducible system [Gossen et al. Science, 268:1766-1769 (1995). see also 
Harvey et al, Curr. Opin. Chem. Biol., 2:512-518 (1998)], the RU486-inducible system 
[Wang et al, Nat. Biotech., 15:239-243 (1997) and Wang et al. Gene Ther., 4:432-441 
25 (1997)1 and the rapamycin-inducible system [Magari etal,J. Clin. Invest., 100:2865-2872 
(1997)]. Other types of inducible promoters which may be useful in this context are those 
which are regulated by a specific physiological state, e.g., temperature, acute phase, a 
particular differentiation state of the cell, or in replicating cells only. 

In another embodiment, the native promoter for the 
30 transgene will be used. The native promoter may be preferred when it is desired that 
expression of the transgene should mimic the native expression. The native promoter 
may be used when expression of the transgene must be regulated temporally or 
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developmental ly, or in a tissue-specific manner, or in response to specific transcriptional 
stimuli. In a further embodiment, other native expression control elements, such as 
enhancer elements, polyadenylation sites or Kozak consensus sequences may also be used 
to mimic the native expression. 

Another embodiment of the transgene includes a gene 
operably linked to a tissue-specific promoter. For instance, if expression in skeletal 
muscle is desired, a promoter active in muscle should be used. These include the 
promoters from genes encoding skeletal p-actin, myosin light chain 2A, dystrophin, 
muscle creatine kinase, as well as synthetic muscle promoters with activities higher than 
naturally-occurring promoters (see Li et ai, Nat. Biotech., 17:241-245 (1999)). Examples 
of promoters that are tissue-specific are known for liver (albumin, Miyatake et al., J. 
Virol., 71:5124-32 (1997); hepatitis B virus core promoter, Sandig et al.. Gene Ther., 
3:1002-9 (1996); alpha-fetoprotein (AFP), Arbuthnot er/ o/., Hum, Gene Ther, 7:1503-14 
(1996)), bone osteocalcin (Stein et aL, MoL Bid Rep,, 24:185-96 (1997)); bone 
15 sialoprotein (Chen et ai, J, Bone Miner. Res., 11:654-64 (1996)), lymphocytes (CD2, 
Hansalera/.,y, Immunol., 161:1063-8(1998); immunoglobulin heavy chain; T cell 
receptor chain), neuronal such as neuron-specific enolase (NSE) promoter (Andersen et 
ai. Cell. Moi NeurobioL, 13:503-15 (1993)), neurofilament light-chain gene (Piccioli et 
aL, Proc. Natl. Acad. Set. USA, 88:561 1-5 (1991)), and the neuron-specific vgf gene 
20 (Piccioli et aL, Neuron, 15:373-84 (1995)), among others. 

Optionally, plasmids carrying therapeutically useful 
transgenes may also include selectable markers or reporter genes may include sequences 
encoding geneticin, hygromicin or purimycin resistance, among others. Such selectable 
reporters or marker genes (preferably located outside the viral genome to be rescued by 
25 the method of the invention) can be used to signal the presence of the plasmids in 
bacterial cells, such as ampicillin resistance. Other components of the plasm id may 
include an origin of replication. Selection of these and other promoters and vector 
elements are conventional and many such sequences are available [see, ef.g., Sambrook et 
al, and references cited therein]. 

^0 The combination of the transgene, promoter/enhancer, and 

5' and 3' AAV ITRs is referred to as a "minigene" for ease of reference herein. Provided 
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with the teachings of this invention, the design of such a minigene can be made by resort 
to conventional techniques. 

3. Delivery of the Minigene to a Packaging Host Cell 

The minigene can be carried on any suitable vector, e.g., a 
5 plasmid, which is delivered to a host cell. The plasmids useful in this invention may be 
engineered such that they are suitable for replication and, optionally, integration in 
prokaryotic cells, mammalian cells, or both. These plasmids (or other vectors carrying 
the 5' AAV ITR-heterologous molecule-3' AAV ITR) contain sequences permitting 
replication of the minigene in eukaryotes and/or prokaryotes and selection markers for 
10 these systems. Selectable markers or reporter genes may include sequences encoding 
geneticin, hygromicin or purimycin resistance, among others. The plasmids may also 
contain certain selectable reporters or marker genes that can be used to signal the 
presence of the vector in bacterial cells, such as ampicillin resistance. Other components 
of the plasmid may include an origin of replication and an amplicon, such as the amplicon 

15 system employing the Epstein Barr virus nuclear antigen. This amplicon system, or other 
similar amplicon components permit high copy episomal replication in the cells. 
Preferably, the molecule carrying the minigene is transfected into the cell, where it may 
exist transiently. Alternatively, the minigene (canying the 5' AAV ITR-heterologous 
molecule-3* ITR) may be stably integrated into the genome of the host cell, either 

20 chromosomally or as an episome. In certain embodiments, the minigene may be present 
in multiple copies, optionally in head-to-head, head-to-tail, or tail-to-tail concatamers. 
Suitable transfection techniques are known and may readily be utilized to deliver the 
minigene to the host cell. 

Generally, when delivering the vector comprising the minigene by 

25 transfection, the vector is delivered in an amount from about 5 ng to about 1 00 ^ig DNA, 
about 10 ^g to about 50 ^g DNA to about 1x10^ cells to about 1 x lO'^ cells, or about 
I X 1 0^ cells. However, the relative amounts of vector DNA to host cells may be 
adjusted, taking into consideration such factors as the selected vector, the delivery method 
and the host cells selected. 

30 B. Rep and Cap Sequences 

In addition to the minigene, the host cell contains the sequences 
which drive expression of a novel AAV capsid protein of the invention (or a capsid 
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protein comprising a fragment thereoO in the host cell and rep sequences of the same 
source as the source of the AAV ITRs found in the minigene, or a cross-complementing 
source. The AAV cap and rep sequences may be independently obtained from an AAV 
source as described above and may be introduced into the host cell in any manner known 
5 to one in the art as described above. Additionally, when pseudotyping an AAV vector in 
{e.g., an AAV9/HU.14 capsid), the sequences encoding each of the essential rep proteins 
may be supplied by different AAV sources {e.g., AAVl, AAV2, AAV3, AAV4, AAV5, 
AAV6, AAV7, AAV8). For example, the rep7S/6S sequences may be from AAV2, 
whereas the rep52/40 sequences may be from AAV8- 
10 In one embodiment, the host cell stably contains the capsid protein 

under the control of a suitable promoter, such as those described above. Most desirably, 
in this embodiment, the capsid protein is expressed under the control of an inducible 
promoter. In another embodiment, the capsid protein is supplied to the host cell in trans. 
When delivered to the host cell in trans, the capsid protein may be delivered via a 
1 5 plasmid which contains the sequences necessary to direct expression of the selected 

capsid protein in the host cell. Most desirably, when delivered to the host cell in trans, 
the plasmid carrying the capsid protein also carries other sequences required for 
packaging the rAAV, e.g., the rep sequences. 

In another embodiment, the host cell stably contains the rep 
20 sequences under the control of a suitable promoter, such as those described above. Most 
desirably, in this embodiment, the essential rep proteins are expressed under the control 
of an inducible promoter. In another embodiment, the rep proteins are supplied to the 
host cell in trans. When delivered to the host cell in trans, the rep proteins may be 
delivered via a plasmid which contains the sequences necessary to direct expression of 
25 the selected rep proteins in the host cell. Most desirably, when delivered to the host cell 
in trans, the plasmid carrying the capsid protein also carries other sequences required for 
packaging the rAAV, e.g,, the rep and cap sequences. 

Thus, in one embodiment, the rep and cap sequences may be 
transfected into the host cell on a single nucleic acid molecule and exist stably in the cell 
30 as an episome. In another embodiment, the rep and cap sequences are stably integrated 
into the chromosome of the cell. Another embodiment has the rep and cap sequences 
transiently expressed in the host cell. For example, a useful nucleic acid molecule for 
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such transfection comprises, from 5' to 3\ a promoter, an optional spacer interposed 
between the promoter and the start site of the rep gene sequence, an AAV rep gene 
sequence, and an AAV cap gene sequence. 

Optionally, the rep and/or cap sequences may be supplied on a 
5 vector that contains other DNA sequences that are to be introduced into the host cells. 
For instance, the vector may contain the rAAV construct comprising the minigene. The 
vector may comprise one or more of the genes encoding the helper functions, e.g., the 
adenoviral proteins El, E2a, and E4 0RF6, and the gene for VAI RNA. 

Preferably, the promoter used in this construct may be any of the 
10 constitutive, inducible or native promoters known to one of skill in the art or as discussed 
above. In one embodiment, an AAV P5 promoter sequence is employed. The selection 
of the AAV to provide any of these sequences does not limit the invention. 

In another preferred embodiment, the promoter for rep is an 
inducible promoter, such as are discussed above in connection with the transgene 
1 5 regulatory elements. One preferred promoter for rep expression is the T7 promoter. The 
vector comprising the rep gene regulated by the T7 promoter and the cap gene, is 
transfected or transformed into a cell which either constitutively or inducibly expresses 
the T7 polymerase. See International Patent Publication No. WO 98/10088, published 
March 12, 1998. 

20 The spacer is an optional element in the design of the vector. The 

spacer is a DNA sequence interposed between the promoter and the rep gene ATG start 
site. The spacer may have any desired design; that is, it may be a random sequence of 
nucleotides, or alternatively, it may encode a gene product, such as a marker gene. The 
spacer may contain genes which typically incorporate start/stop and polyA sites. The 

25 spacer may be a non-coding DNA sequence from a prokaryote or eukaryote, a repetitive 
non-coding sequence, a coding sequence without transcriptional controls or a coding 
sequence with transcriptional controls. Two exemplary sources of spacer sequences are 
the phage ladder sequences or yeast ladder sequences, which are available commercially, 
e.g., from Gibco or Invitrogen, among others. The spacer may be of any size sufficient to 

30 reduce expression of the replS and rep6S gene products, leaving the rep52, rep40 and 
cap gene products expressed at normal levels. The length of the spacer may therefore 
range from about 10 bp to about 10.0 kbp, preferably in the range of about 100 bp to 
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about 8.0 kbp. To reduce the possibility of recombination, the spacer is preferably less 
than 2 kbp in length; however, the invention is not so limited. 

Although the molecule(s) providing rep and cap may exist in the 
host cell transiently {i.e., through transfection), it is preferred that one or both of the rep 
5 and cap proteins and the promoter(s) controlling their expression be stably expressed in 
the host cell, e.g., as an episome or by integration into the chromosome of the host cell. 
The methods employed for constructing embodiments of this invention are conventional 
genetic engineering or recombinant engineering techniques such as those described in the 
references above. While this specification provides illustrative examples of specific 
1 0 constructs, using the information provided herein, one of skill in the art may select and 
design other suitable constructs, using a choice of spacers, P5 promoters, and other 
elements, including at least one translational start and stop signal, and the optional 
addition of polyadenylation sites. 

In another embodiment of this invention, the rep or cap protein 
1 5 may be provided stably by a host cell. 

C. The Helper Functions 

The packaging host cell also requires helper functions in order to 
package the rAAV of the invention. Optionally, these functions may be supplied by a 
herpesvirus. Most desirably, the necessary helper functions are each provided from a 
20 human or non-human primate adenovirus source, such as those described above and/or 
are available from a variety of sources, including the American Type Culture Collection 
(ATCC), Manassas, VA (US). In one currently preferred embodiment, the host cell is 
provided with and/or contains an El a gene product, an El b gene product, an E2a gene 
product, and/or an E4 ORF6 gene product. The host cell may contain other adenoviral 
25 genes such as VAl RNA, but these genes are not required. In a preferred embodiment, no 
other adenovirus genes or gene functions are present in the host cell. 

By "adenoviral DNA which expresses the Ela gene product", it is 
meant any adenovirus sequence encoding Ela or any functional Ela portion. Adenoviral 
DNA which expresses the E2a gene product and adenoviral DN A which expresses the E4 
30 0RF6 gene products are defined similarly. Also included are any alleles or other 

modifications of the adenoviral gene or functional portion thereof Such modifications 
may be deliberately introduced by resort to conventional genetic engineering or 
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mutagenic techniques to enhance the adenoviral function in some manner, as well as 
naturally occurring allelic variants thereof. Such modifications and methods for 
manipulating DNA to achieve these adenovirus gene functions are known to those of skill 
in the art. 

5 The adenovirus Ela, Elb, E2a, and/or E40RF6 gene products, as 

well as any other desired helper functions, can be provided using any means that allows 
their expression in a cell. Each of the sequences encoding these products may be on a 
separate vector, or one or more genes may be on the same vector. The vector may be any 
vector known in the art or disclosed above, including plasmids, cosmids and viruses. 

10 Introduction into the host cell of the vector may be achieved by any means known in the 
art or as disclosed above, including transfection, infection, electroporation, liposome 
delivery, membrane fusion techniques, high velocity DNA-coated pellets, viral infection 
and protoplast fusion, among others. One or more of the adenoviral genes may be stably 
integrated into the genome of the host cell, stably expressed as episomes, or expressed 

15 transiently. The gene products may all be expressed transiently, on an episome or stably 
integrated, or some of the gene products may be expressed stably while others are 
expressed transiently. Furthermore, the promoters for each of the adenoviral genes may 
be selected independently from a constitutive promoter, an inducible promoter or a native 
adenoviral promoter. The promoters may be regulated by a specific physiological state of 

20 the organism or cell {i.e., by the differentiation state or in replicating or quiescent cells) or 
by exogenously added factors, for example. 

D. Host Cells And Packaging Cell Lines 

The host cell itself may be selected from any biological organism, 
including prokaryotic (e.g., bacterial) cells, and eukaryotic cells, including, insect cells, 

25 yeast cells and mammalian cells. Particularly desirable host cells are selected from 

among any mammalian species, including, without limitation, cells such as A549, WEHL 
3T3, lOTI/2, BHK, MDCK, COS 1, COS 7, BSC 1, BSC 40, BMT 10, VERO, WI38, 
HeLa, 293 cells (which express functional adenoviral El), Saos, C2C12, L cells, HTI080, 
HepG2 and primary fibroblast, hepatocyte and myoblast cells derived from mammals 

30 including human, monkey, mouse, rat, rabbit, and hamster. The selection of the 

mammalian species providing the cells is not a limitation of this invention; nor is the type 
of mammalian cell, i.e., fibroblast, hepatocyte, tumor cell, etc. The requirements for the 
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cell used is that it not carry any adenovirus gene other than El, E2a and/or E4 0RF6: it 
not contain any other virus gene which could result in homologous recombination of a 
contaminating virus during the production of rAAV; and it is capable of infection or 
transfection of DNA and expression of the transfectcd DNA. in a preferred cmboJimcni. 
5 the host cell is one that has rep and cap stably transfected in the cell. 

One host cell useful in the present invention is a host cell stably 
transformed with the sequences encoding rep and cap, and which is transfected with the 
adenovirus El , E2a, and E40RF6 DNA and a construct carrying the minigene as 
described above. Stable rep and/or cap expressing cell lines, such as B-50 (International 
10 Patent Application Publication No. WO 99/15685), or those described in US Patent No. 
5,658,785, may also be similarly employed. Another desirable host cell contains the 
minimum adenoviral DNA which is sufficient to express E4 0RF6. Yet other cell lines 
can be constructed using the novel AAV9 cap sequences of the invention. 

The preparation of a host cell according to this invention involves 
15 techniques such as assembly of selected DNA sequences. This assembly may be 

accomplished utilizing conventional techniques. Such techniques include cDNA and 
genomic cloning, which are well known and are described in Sambrook et at., cited 
above, use of overlapping oligonucleotide sequences of the adenovirus and AAV 
genomes, combined with polymerase chain reaction, synthetic methods, and any other 
20 suitable methods which provide the desired nucleotide sequence. 

Introduction of the molecules (as plasmids or viruses) into the host 
cell may also be accomplished using techniques known to the skilled arti.san and as 
discussed throughout the specification. In preferred embodiment, standard transfection 
techniques are used, e.g., CaP04 transfection or electroporation, and/or infection by 
25 hybrid adenovirus/AAV vectors into cell lines such as the human embryonic kidney cell 
line HEK 293 (a human kidney cell line containing functional adenovirus Fl iicnes which 
provides /ra/7.y-acting El proteins). 

The AAV9/HU.14 based vectors which are generated by one of 
skill in the art are beneficial for gene delivery to selected host cells and gene therapy 
30 patients since no neutralization antibodies to AAV9/HU.14 have been found in the human 
population. One of skill in the art may readily prepare other rAAV viral vectors 
containing the AAV9/HU.14 capsid proteins provided herein using a variety of 
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techniques known to those of skill in the art. One may similarly prepare still other rAA V 
viral vectors containing AAV9/HU.I4 sequence and AAV capsids from another source. 

One of skill in the an will readily understand ihnl ihc no\ cl AAV 
sequences of the invention can be readily adapted for use in these and other viral vector 
5 systems for in vitro, ex vivo or in vivo gene delivery. Similarly, one of skill in the art can 
readily select other fragments of the AAV genome of the invention for use in a variety of 
rAAV and non-rAAV vector systems. Such vectors systems may include, 
Antiviruses, retroviruses, poxviruses, vaccinia viruses, and adenoviral systems, amony 
others. Selection of these vector systems is not a limitation of the present invention. 

10 Thus, the invention further provides vectors generated using the 

nucleic acid and amino acid sequences of the novel AAV of the invention. Such vectors 
are useful for a variety of purposes, including for delivery of therapeutic molecules and 
for use in vaccine regimens. Particularly desirable for delivery of therapeutic molecules 
are recombinant AAV containing capsids of the novel AAV of the invention. These, or 

15 other vector constructs containing novel AAV sequences of the invention may be used in 
vaccine regimens, eg,, for co-delivery of a cytokine, or for delivery of the immunogcn 
itself 

IV. Recombinant Viruses And Uses Therefor 

20 Using the techniques described herein, one of skill in the ait can generate a rAAV 

having a capsid of an AAV of the invention or having a capsid containing one or more 
fragments of an AAV of the invention. In one embodiment, a full-length capsid from a 
single AAV, e.g., hu.l4/AAV9 [SEQ ID NO: 123] can be utilized. In another 
embodiment, a full-length capsid may be generated which contains one or more 

25 fragments of the novel AAV capsid of the invention iiised in frame vviih sequences from 
another selected AAV, or from heterologous {i.e., non-contiguous) portions of the same 
AAV. For example, a rAAV may contain one or more of the novel hypervariable region 
sequences of AAV9/HU.14. Alternatively, the unique AAV sequences of the invcnii(>n 
may be used in constructs containing other viral or non-viral sequences. Optionally, a 

30 recombinant virus, may carry AAV rep sequences encoding one or more of the AAV rep 
proteins. 
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A. Delivery of Viruses 

In another aspect, the present invention provides a method for delivery of 
a transgene to a host which involves transfecting or infecting a selected host cell with a 
recombinant viral vector generated with the AAV9/HU, 14 sequences (or functional 
5 fragments thereoQ of the invention. Methods for delivery are well known lo ihose of skill 
in the art and are not a limitation of the present invention. 

In one desirable embodiment, the invention provides ;i method fbr AA V- 
mediated delivery of a transgene to a host. This method involves transfecting or inlecting 
a selected host cell with a recombinant viral vector containing a selected transgene under 
10 the control of sequences that direct expression thereof and AAV9 capsid proteins. 

Optionally, a sample from the host may be first assayed for the presence 
of antibodies to a selected AAV source (e.g., a serotype). A variety of assay formats for 
detecting neutralizing antibodies are well known to those of skill in the an. The .selection 
of such an assay is not a limitation of the present invention. See, e.g.. Fisher et al. Nature 
1 5 Med , 3(3):306-3 1 2 (March 1 997) and W. C. Manning et al. Human Gene Therapy, 
9:477-485 (March 1, 1998). The results of this assay may be used to determine which 
AAV vector containing capsid proteins of a particular source are preferred for delivery. 
e.g., by the absence of neutralizing antibodies specific for that capsid source. 

In one aspect of this method, the delivery of vector with AAV capsid 
20 proteins of the invention may precede or follow delivery of a gene via a vector with a 
different AAV capsid protein. Thus, gene delivery via rAAV vectors may be used for 
repeat gene delivery to a selected host cell. Desirably, subsequently administered rAAV 
vectors carry the same transgene as the first rAAV vector, but the subsequently 
administered vectors contain capsid proteins of sources (and preferably, different 
25 serotypes) which differ from the first vector. For example, if a first vector has 

AAV9/HU.14 capsid proteins, subsequently administered vectors may have capsid 
proteins selected from among the other AAV, optionally, from another serotype or from 
another clade. 

Optionally, multiple rAAV vectors can be used lo deliver large iransgeiies. 
30 or multiple transgenes by co-administration of rAAV vectors concatamerize in vivo to 
form a single vector genome. In such an embodiment, a first AAV may carry an 
expression cassette which expresses a single transgene (or a subunit thereol) and a second 
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AAV may carry an expression cassette which expresses a second transgenc (or a d.llerent 
subunit) for co-expression in the host cell. A f.rst AAV may carry an express.on cassette 
which is a first piece of a polycistronic construct (^.g.. a promoter and transgenc. or 
subunit) and a second AAV may carry an expression cassette which is a second p.ece ot a 
5 polycistronic construct (..g., transgene or subunit and a polyA sequence). The.c two 
pieces of a polycistronic construct concatamerize /. v/vo to form a single vector genome 
that co-expresses the transgenes delivered by the first and second AAV. In such 
embodiments, the rAAV vector carrying the f.rst expression cassette and the rA A V vector 
carrying the second expression cassette can be delivered in a single pharmaccut.c.l 
10 composition. In other embodiments, the two or more rAAV vectors are delivered ns 
separate pharmaceutical compositions which can be administered substantially 
simultaneously, or shortly before or after one another. 

The above-described recombinant vectors may be delivered to host cells 
according to published methods. The rAAV, preferably suspended in a physiologically 
1 5 compatible carrier, may be administered to a human or non-human mammalian pat.ent. 
Suitable carriers may be readily selected by one of skill in the a., in view of the .nd.cat.on 
for which the transfer virus is directed. For example, one suitable carrier includes sahne, 
which may be formulated with a variety of buffering solutions (eg., phosphate buttered 
saline) Other exemplary carriers include sterile saline, lactose, sucrose, calcium 
20 phosphate, gelatin, dextran, agar, pectin, peanut oil, sesame oil, and water. The select.on 
of the carrier is not a limitation of the present invention. 

Optionally, the compositions of the invemion may contain, in addition to 
the rAAV and carrier(s), other conventional pharmaceutical ingredients, such as 
preservatives, or chemical stabilizers. Suitable exemplary preservatives include 
25 chlorobutanol, potassium sorbate, sorbic acid, sulfur dioxide, propyl gallate, the parabens. 
ethyl vanillin, glycerin, phenol, and parachlorophenol. Suitable chemical stab.l.zers 

include gelatin and albumin. 

The vectors are administered in sufficient amounts to Iranslecl the cells 
and to provide sufficient levels of gene transfer and expression to provide a therapetitic 
30 benefit without undue adverse effects, or with medically acceptable physiological etlects, 
which can be determined by those skilled in the medical arts. Conventional and 
pharmaceutically acceptable routes of admin.stration include, but are not limited to. dnect 
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delivery to a desired organ (e.g., the liver (optionally via the hepatic artery) or lung), oral, 
inhalation, intranasal, intratracheal, intraarterial, intraocular, intravenous, intramuscular, 
subcutaneous, intradermal, and other parental routes of administration. Routes of 
administration may be combined, if desired. 
5 Dosages of the viral vector will depend primarily on factors such as the 

condition being treated, the age, weight and health of the patient, and may thus vary 
among patients. For example, a therapeutically effective human dosage of the viral vector 
is generally in the range of from about 0.1 mL to about 100 mL of solution containing 
concentrations of from about 1 x 10^ to 1 x 10^^ genomes virus vector. A preferred 

1 0 human dosage for delivery to large organs (e.g., liver, muscle, heart and lung) may be 

about 5x10'^ to 5 X lO'^ AAV genomes per 1 kg, at a volume of about 1 to 100 mL. A 
preferred dosage for delivery to eye is about 5 x 10^ to 5 x 10*^ genome copies, at a 
volume of about 0.1 mL to 1 mL. The dosage will be adjusted to balance the therapeutic 
benefit against any side effects and such dosages may vary depending upon the 

15 therapeutic application for which the recombinant vector is employed. The levels of 
expression of the transgene can be monitored to determine the frequency of dosage 
resulting in viral vectors, preferably AAV vectors containing the minigene. Optionally, 
dosage regimens similar to those described for therapeutic purposes may be utilized for 
immunization using the compositions of the invention. 

20 Examples of therapeutic products and immunogenic products for delivery 

by the AA V-containing vectors of the invention are provided below. These vectors may 
be used for a variety of therapeutic or vaccinal regimens, as described herein. 
Additionally, these vectors may be delivered in combination with one or more other 
vectors or active ingredients in a desired therapeutic and/or vaccinal regimen. 

25 B. Therapeutic Transgenes 

Useful therapeutic products encoded by the transgene include hormones 
and growth and differentiation factors including, without limitation, insulin, glucagon, 
growth hormone (GH), parathyroid hormone (PTH), growth hormone releasing factor 
(GRF), follicle stimulating hormone (FSH), luteinizing hormone (LH), human chorionic 

30 gonadotropin (hCG), vascular endothelial growth factor (VEGF), angiopoietins, 
angiostatin, granulocyte colony stimulating factor (GCSF), erythropoietin (EPO), 
connective tissue growth factor (CTGF), basic fibroblast growth factor (bFGF), acidic 
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fibroblast growth factor (aFGF), epidermal growth factor (EGF), platelet-derived growth 
factor (PDGF), insulin growth factors I and II (IGF-1 and IGF-IJ), any one of the 
transforming growth factor a superfamily, including TGFa, activins, inhibins, or any of 
the bone morphogenic proteins (BMP) BMPs 1-15, any one of the 
5 heregluin/neuregulin/ARlA/neu differentiation factor (NDF) family of growth factors, 
nerve growth factor (NGF), brain-derived neurotrophic factor (BDNF), neurotrophins 
NT-3 and NT-4/5, ciliary neurotrophic factor (CNTF), glial cell line derived neurotrophic 
factor (GDNF), neurturin, agrin, any one of the family of semaphorins/collapsins, netrin-1 
and netrin-2, hepatocyte growth factor (HGF), ephrins, noggin, sonic hedgehog and 

10 tyrosine hydroxylase. 

Other useful transgene products include proteins that regulate the immune 
system including, without limitation, cytokines and lymphokines such as thrombopoietin 
(TPO), interleukins (IL) lL-1 through IL-25 (including, e.g., IL-2, lL-4, IL-12 and lL-18), 
monocyte chemoattractant protein, leukemia inhibitory factor, granulocyte-macrophage 

15 colony stimulating factor. Fas ligand, tumor necrosis factors a and p, interferons a, p, 
and Y, stem cell factor, flk-2/flt3 ligand. Gene products produced by the immune system 
are also useful in the invention. These include, without limitations, immunoglobulins 
IgG, IgM, IgA, IgD and IgE, chimeric immunoglobulins, humanized antibodies, single 
chain antibodies, T cell receptors, chimeric T cell receptors, single chain T cell receptors, 

20 class I and class 11 MHC molecules, as well as engineered immunoglobulins and MHC 
molecules. Useful gene products also include complement regulatory proteins such as 
complement regulatory proteins, membrane cofactor protein (MCP), decay accelerating 
factor (DAF), CRl, CF2 and CD59. 

Still other useful gene products include any one of the receptors for the 

25 hormones, growth factors, cytokines, lymphokines, regulatory proteins and immune 

system proteins. The invention encompasses receptors for cholesterol regulation and/or 
lipid modulation, including the low density lipoprotein (LDL) receptor, high density 
lipoprotein (HDL) receptor, the very low density lipoprotein (VLDL) receptor, and 
scavenger receptors. The invention also encompasses gene products such as members of 

30 the steroid hormone receptor superfamily including glucocorticoid receptors and estrogen 
receptors. Vitamin D receptors and other nuclear receptors. In addition, useful gene 
products include transcription factors such as Jun,fos, max, mad, serum response factor 
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(SRF), AP-1, AP2, myb, MyoD and myogenin, ETS-box containing proteins, TFE3, E2F, 
ATFl, ATF2, ATF3, ATF4, ZF5, NFAT, CREB, HNF-4, C/EBP, SPl, CCAAT-box 
binding proteins, interferon regulation factor (IRf-1), Wilms tumor protein, ETS-binding 
protein, STAT, GATA-box binding proteins, e.g., GATA-3, and the forkhead family of 
5 winged helix proteins. 

Other useful gene products include, carbamoyl synthetase 1, ornithine 
transcarbamylase, arginosuccinate synthetase, arginosuccinate lyase, arginase, 
fumarylacetacetate hydrolase, phenylalanine hydroxylase, alpha- 1 antitrypsin, glucose-6- 
phosphatase, porphobilinogen deaminase, cystathione beta-synthase, branched chain 
0 ketoacid decarboxylase, albumin, isovaleryUcoA dehydrogenase, propionyl CoA 

carboxylase, methyl malonyl CoA mutase, glutaryl CoA dehydrogenase, insulin, beta- 
glucosidase, pyruvate carboxylate, hepatic phosphorylase, phosphorylase kinase, glycine 
decarboxylase, H-protein, T-protein, a cystic fibrosis transmembrane regulator (CFTR) 
sequence, and a dystrophin gene product [e.g., a mini- or micro-dystrophin]. Still other 
5 useful gene products include enzymes such as may be useful in enzyme replacement 
therapy, which is useful in a variety of conditions resulting from deficient activity of 
enzyme. For example, enzymes that contain mannose-6-phosphate may be utilized in 
therapies for lysosomal storage diseases (e.g., a suitable gene includes that encoding p- 
glucuronidase (GUSB)). 

Still other useful gene products include those used for treatment of 
hemophilia, including hemophilia B (including Factor IX) and hemophilia A (including 
Factor Vlll and its variants, such as the light chain and heavy chain of the heterodimer 
and the B-deleted domain; US Patent No. 6,200,560 and US Patent No. 6,221,349). The 
Factor VIIl gene codes for 2351 amino acids and the protein has six domains, designated 
from the amino to the terminal carboxy terminus as A1-A2-B-A3-C1-C2 [Wood et al, 
Nature, 312.330 (1984); Vehar et at., Nature 312:337 (1984); and Toole et al. Nature, 
342:337 (1984)]. Human Factor Vlll is processed within the cell to yield a heterodimer 
primarily comprising a heavy chain containing the Al, A2 and B domains and a light 
chain containing the A3, CI and C2 domains. Both the single chain polypeptide and the 
heterodimer circulate in the plasma as inactive precursors, until activated by thrombin 
cleavage between the A2 and B domains, which releases the B domain and results in a 
heavy chain consisting of the Al and A2 domains. The B domain is deleted in the 
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activated procoagulant form of the protein. Additionally, in the native protein, two 
polypeptide chains ("a" and "b"), flanking the B domain, are bound to a divalent calcium 
cation. 

In some embodiments, the minigene comprises first 57 base pairs of the 
5 Factor Vlll heavy chain which encodes the 10 amino acid signal sequence, as well as the 
human growth hormone (hGH) polyadenylation sequence. In alternative embodiments, 
the minigene further comprises the A I and A2 domains, as well as 5 amino acids from the 
Isl-terminus of the B domain, and/or 85 amino acids of the C-terminus of the B domain, as 
well as the A3, CI and C2 domains. In yet other embodiments, the nucleic acids 
10 encoding Factor VIII heavy chain and light chain are provided in a single minigene 

separated by 42 nucleic acids coding for 14 amino acids of the B domain [US Patent No. 
6,200,560]. 

As used herein, a therapeutically effective amount is an amount of AAV 
vector that produces sufficient amounts of Factor VIII to decrease the time it takes for a 

1 5 subject's blood to clot. Generally, severe hemophiliacs having less than 1% of normal 
levels of Factor Vlll have a whole blood clotting time of greater than 60 minutes as 
compared to approximately 10 minutes for non-hemophiliacs. 

The present invention is not limited to any specific Factor Vlll sequence. 
Many natural and recombinant forms of Factor Vlll have been isolated and generated. 

20 Examples of naturally occurring and recombinant forms of Factor Vll can be found in the 
patent and scientific literature including, US Patent No. 5,563,045, US Patent No. 
5,451,521, US Patent No. 5,422,260, US Patent No. 5,004,803, US Patent No. 4,757,006, 
US Patent No. 5,661,008, US Patent No. 5,789,203, US Patent No. 5,681,746, US Patent 
No. 5,595,886, US Patent No. 5,045,455, US Patent No. 5,668,108, US Patent No. 

25 5,633,150, US Patent No. 5,693,499, US Patent No. 5,587,310, US Patent No. 5,171,844, 
US Patent No. 5,149,637, US Patent No, 5,1 12,950, US Patent No. 4,886,876; 
International Patent Publication Nos. WO 94/1 1503, WO 87/07144, WO 92/16557, WO 
91/09122, WO 97/03195, WO 96/21035, and WO 91/07490; European Patent 
Application Nos. EP 0 672 138, EP 0 270 618, EP 0 182 448, EP 0 162 067, EP 0 786 

30 474, EP 0 533 862, EP 0 506 757, EP 0 874 057,EP 0 795 021, EP 0 670 332, EP 0 500 
734, EP 0 232 1 12, and EP 0 160 457; Sanberg et ah, XXth Int. Congress of the World 
Fed. Of Hemophilia (1992), and Lind etal,, Eur. J. Biochem., 232:19 (1995), 
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Nucleic acids sequences coding for the above-described Factor VllI can 
be obtained using recombinant methods or by deriving the sequence from a vector known 
to include the same. Furthermore, the desired sequence can be isolated directly from cells 
and tissues containing the same, using standard techniques, such as phenol extraction and 
5 PCR of cDNA or genomic DNA [See, e.g., Sambrook et al]. Nucleotide sequences can 
also be produced synthetically, rather than cloned. The complete sequence can be 
assembled from overlapping oligonucleotides prepared by standard methods and 
assembled into a complete coding sequence [See, e.g.. Edge, Nature 292:757 (1981); 
Nambari et al, Science, 223:1299 (1984); and Jay el al, J. Biol. Chem. 259:631 1 (1984). 

10 Furthermore, the invention is not limited to human Factor VIII. Indeed, it 

is intended that the present invention encompass Factor VIU from animals other than 
humans, including but not limited to companion animals {e.g., canine, felines, and 
equines), livestock {e.g., bovines, caprines and ovines), laboratory animals, marine 
mammals, large cats, etc. 

1 5 The AAV vectors may contain a nucleic acid coding for fragments of 

Factor VIII which is itself not biologically active, yet when administered into the subject 
improves or restores the blood clotting time. For example, as discussed above, the Factor 
VIll protein comprises two polypeptide chains: a heavy chain and a light chain separated 
by a B-domain which is cleaved during processing. As demonstrated by the present 

20 invention, co-tranducing recipient cells with the Factor Vlll heavy and light chains leads 
to the expression of biologically active Factor VIII. Because most hemophiliacs contain a 
mutation or deletion in only one of the chains {e.g., heavy or light chain), it may be 
possible to administer only the chain defective in the patient to supply the other chain. 

Other useful gene products include non-naturally occurring polypeptides, 

25 such as chimeric or hybrid polypeptides having a non-naturally occurring amino acid 
sequence containing insertions, deletions or amino acid substitutions. For example, 
single-chain engineered immunoglobulins could be useful in certain 
immunocompromised patients. Other types of non-naturally occurring gene sequences 
include antisense molecules and catalytic nucleic acids, such as ribozymes, which could 

30 be used to reduce overexpression of a target. 

Reduction and/or modulation of expression of a gene is particularly 
desirable for treatment of hyperproliferative conditions characterized by 
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hyperproliferating cells, as are cancers and psoriasis. Target polypeptides include those 
polypeptides which are produced exclusively or at higher levels in hyperproliferative cells 
as compared to normal cells. Target antigens include polypeptides encoded by oncogenes 
such as myb, myc, fyn, and the translocation gene bcr/abl, ras, src, P53, neu, trk and 
5 EGRP. Tn addition to oncogene products as target antigens, target polypeptides for 
anti-cancer treatments and protective regimens include variable regions of antibodies 
made by B cell lymphomas and variable regions of T cell receptors of T cell lymphomas 
which, in some embodiments, are also used as target antigens for autoimmune disease. 
Other tumor-associated polypeptides can be used as target polypeptides such as 

10 polypeptides which are found at higher levels in tumor cells including the polypeptide 
recognized by monoclonal antibody 17-lA and folate binding polypeptides. 

Other suitable therapeutic polypeptides and proteins include those which 
may be useful for treating individuals suffering from autoimmune diseases and disorders 
by conferring a broad based protective immune response against targets that are 

15 associated with autoimmunity including cell receptors and cells which produce "self - 
directed antibodies. T cell mediated autoimmune diseases include Rheumatoid arthritis 
(RA), multiple sclerosis (MS), Sjogren's syndrome, sarcoidosis, insulin dependent 
diabetes mellitus (IDDM), autoimmune thyroiditis, reactive arthritis, ankylosing 
spondylitis, scleroderma, polymyositis, dermatomyositis, psoriasis, vasculitis, Wegener's 

20 granulomatosis, Crohn's disease and ulcerative colitis. Each of these diseases is 

characterized by T cell receptors (TCRs) that bind to endogenous antigens and initiate the 
inflammatory cascade associated with autoimmune diseases. 
C. Immunogenic Transgenes 

Suitably, the AAV vectors of the invention avoid the generation of 

25 immune responses to the AAV sequences contained within the vector. However, these 
vectors may nonetheless be formulated in a manner that permits the expression of a 
transgene carried by the vectors to induce an immune response to a selected antigen. For 
example, in order to promote an immune response, the transgene may be expressed from 
a constitutive promoter, the vector can be adjuvanted as described herein, and/or the 

30 vector can be put into degenerating tissue. 

Examples of suitable immunogenic transgenes include those selected from 
a variety of viral families. Examples of desirable viral families against which an immune 
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response would be desirable include, the picornavirus family, which includes the genera 
rhinoviruses, which are responsible for about 50% of cases of the common cold; the 
genera enteroviruses, which include polioviruses, coxsackieviruses, echoviruses, and 
human enteroviruses such as hepatitis A virus; and the genera apthoviruses, which are 
5 responsible for foot and mouth diseases, primarily in non-human animals. Within the 
picornavirus family of viruses, target antigens include the VPl, VP2, VP3, VP4, and 
VPG. Other viral families include the astroviruses and the calcivirus family. The 
calcivirus family encompasses the Norwalk group of viruses, which are an important 
causative agent of epidemic gastroenteritis. Still another viral family desirable for use in 

1 0 targeting antigens for inducing immune responses in humans and non-human animals is 
the togavirus family, which includes the genera alphavirus, which include Sindbis viruses, 
RossRiver virus, and Venezuelan, Eastern & Western Equine encephalitis, and rubivirus, 
including Rubella virus. The flaviviridae family includes dengue, yellow fever, Japanese 
encephalitis, St. Louis encephalitis and tick borne encephalitis viruses. Other target 

1 5 antigens may be generated from the Hepatitis C or the coronavirus family, which includes 
a number of non-human viruses such as infectious bronchitis virus (poultry), porcine 
transmissible gastroenteric virus (pig), porcine hemagglutinatin encephalomyelitis virus 
(pig), feline infectious peritonitis virus (cat), feline enteric coronavirus (cat), canine 
coronavirus (dog), and human respiratory coronaviruses, which may cause the common 

20 cold and/or non-A, B or C hepatitis, and which include the putative cause of sudden acute 
respiratory syndrome (SARS). Within the coronavirus family, target antigens include the 
El (also called M or matrix protein), E2 (also called S or Spike protein), E3 (also called 
HE or hemagglutin-elterose) glycoprotein (not present in all coronaviruses), or N 
(nucleocapsid). Still other antigens may be targeted against the arterivirus family and the 

25 rhabdovirus family. The rhabdovirus family includes the genera vesiculovirus (e.g.. 
Vesicular Stomatitis Virus), and the general lyssavirus (e.g., rabies). Within the 
rhabdovirus family, suitable antigens may be derived from the G protein or the N protein. 
The family filoviridae, which includes hemorrhagic fever viruses such as Marburg and 
Ebola virus may be a suitable source of antigens. The paramyxovirus family includes 

30 parainfluenza Virus Type 1 , parainfluenza Virus Type 3, bovine parainfluenza Virus Type 
3, rubulavirus (mumps virus, parainfluenza Virus Type 2, parainfluenza virus Type 4, 
Newcastle disease virus (chickens), rinderpest, morbillivirus, which includes measles and 
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canine distemper, and pneumovirus, which includes respiratory syncytial virus. The 
influenza virus is classified within the family orthomyxovirus and is a suitable source of 
antigen (e.g., the HA protein, the Nl protein). The bunyavirus family includes the genera 
bunyavirus (California encephalitis, La Crosse), phlebovirus (Rift Valley Fever), 
5 hantavirus (puremala is a hemahagin fever virus), nairovirus (Nairobi sheep disease) and 
various unassigned bungaviruses. The arenavirus family provides a source of antigens 
against LCM and Lassa fever virus. Another source of antigens is the bornavirus family. 
The reovirus family includes the genera reovirus, rotavirus (which causes acute 
gastroenteritis in children), orbiviruses, and cultivirus (Colorado Tick fever, Lebombo 

10 (humans), equine encephalosis, blue tongue). The retrovirus family includes the 

sub-family oncorivirinal which encompasses such human and veterinary diseases as feline 
leukemia virus, HTLVl and HTLVII, lentivirinal (which includes HIV, simian 
immunodeficiency virus, feline immunodeficiency virus, equine infectious anemia virus, 
and spumavirinal). The papovavirus family includes the sub-family polyomaviruses 

1 5 (BKU and JCU viruses) and the sub-family papillomavirus (associated with cancers or 
malignant progression of papilloma). The adenovirus family includes viruses (EX, AD7, 
AR.D, O.B.) which cause respiratory disease and/or enteritis. The parvovirus family 
includes feline parvovirus (feline enteritis), feline panleucopeniavirus, canine parvovirus, 
and porcine parvovirus. The herpesvirus family includes the sub-family 

20 alphaherpesvirinae, which encompasses the genera simplexvirus (HSVI, HS VII), 

varicellovirus (pseudorabies, varicella zoster) and the sub-family betaherpesvirinae, 
which includes the genera cytomegalovirus (HCMV, muromegalovirus) and the 
sub-family gammaherpesvirinae, which includes the genera lymphocryptovirus, EBV 
(Burkitts lymphoma), human herpesviruses 6A, 6B and 7, Kaposi's sarcoma-associated 

25 herpesvirus and cercopithecine herpesvirus (B virus), infectious rhinotracheitis, Marek's 
disease virus, and rhadinovirus. The poxvirus family includes the sub-family 
chordopoxvirinae, which encompasses the genera orthopoxvirus (Variola major 
(Smallpox) and Vaccinia (Cowpox)), parapoxvirus, avipoxvirus, capripoxvirus, 
leporipoxvirus, suipoxvirus, and the sub-family entomopoxvirinae. The hepadnavirus 

30 family includes the Hepatitis B virus. One unclassified virus which may be suitable 
source of antigens is the Hepatitis delta virus. Hepatitis E virus, and prions. Another 
virus which is a source of antigens is Nipan Virus. Still other viral sources may include 
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avian infectious bursal disease virus and porcine respiratory and reproductive syndrome 
virus. The alphavirus family includes equine arteritis virus and various Encephalitis 
viruses. 

The present invention may also encompass immunogens which are useful 
5 to immunize a human or non-human animal against other pathogens including bacteria, 
fungi, parasitic microorganisms or multicellular parasites which infect human and non- 
human vertebrates, or from a cancer cell or tumor cell. Examples of bacterial pathogens 
include pathogenic gram-positive cocci include pneumococci; staphylococci (and the 
toxins produced thereby, e.g., enterotoxin B); and streptococci. Pathogenic 

10 gram-negative cocci include meningococcus; gonococcus. Pathogenic enteric 

gram-negative bacilli include enterobacteriaceae; pseudomonas, acinetobacteria and 
eikenella; melioidosis; salmonella; shigella; haemophilus; moraxella; H. ducreyi (ys/hich 
causes chancroid); brucella species (brucellosis); Francisella lularensis (which causes 
tularemia); Yersinia pestis (plague) and other yersinia (pasteurella); streptobacillus 

15 moniliformis and spirillum; Gram-positive bacilli include listeria monocytogenes; 
erysipelothrix rhusiopathiae; Coryne bacterium diphtheria (diphtheria); cholera; B, 
anthracis (anthrax); donovanosis (granuloma inguinale); and bartonellosis. Diseases 
caused by pathogenic anaerobic bacteria include tetanus; botulism {Clostridum botulinum 
and its toxin); Clostridium perfringens and its epsilon toxin; other Clostridia; 

20 tuberculosis; leprosy; and other mycobacteria. Pathogenic spirochetal diseases include 
syphilis; treponematoses: yaws, pinta and endemic syphilis; and leptospirosis. Other 
infections caused by higher pathogen bacteria and pathogenic fungi include glanders 
(Burkhoideria mallei); actinomycosis; nocardiosis; cryptococcosis, blastomycosis, 
histoplasmosis and coccidioidomycosis; candidiasis, aspergillosis, and mucormycosis; 

25 sporotrichosis; paracoccidiodomycosis, petriellidiosis, torulopsosis, mycetoma and 
chromomycosis; and dermatophytosis. Rickettsial infections include Typhus fever, 
Rocky Mountain spotted fever, Q fever (Coxielia burnetii), and Rickettsialpox. Examples 
of mycoplasma and chlamydial infections include: mycoplasma pneumoniae; 
lymphogranuloma venereum; psittacosis; and perinatal chlamydial infections. 

30 Pathogenic eukaryotes encompass pathogenic protozoans and helminths and infections 
produced thereby include: amebiasis; malaria; leishmaniasis; trypanosomiasis; 
toxoplasmosis; Pneumocystis carinii; Trichans; Toxoplasma gondii; babesiosis; 
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giardiasis; trichinosis; filariasis; schistosomiasis; nematodes; trematodes or flukes; and 
cestode (tapeworm) infections. 

Many of these organisms and/or the toxins produced thereby have been 
identified by the Centers for Disease Control [(CDC), Department of Heath and Human 
5 Services, USA], as agents vs/Wich have potential for use in biological attacks. For example, 
some of these biological agents, include. Bacillus anthracis (anthrax), Clostridium 
botulinum and its toxin (botulism), Yersinia pesiis (plague), variola major (smallpox), 
Francisella tularensis (tularemia), and viral hemorrhagic fevers [filoviruses {e.g., Ebola, 
Marburg], and arenaviruses [e.g., Lassa, Machupo]), all of which are currently classified 

10 as Category A agents; Coxiella burnetti (Q fever); Brucella species (brucellosis), 
Burkholderia mallei (glanders), Burkholderia pseudomallei (meloidosis), Ricinus 
communis and its toxin (ricin toxin), Clostridium perfringens and its toxin (epsilon toxin). 
Staphylococcus species and their toxins (enterotoxin B), Chlamydia psittaci (psittacosis), 
water safety threats {e g-. Vibrio cholerae, Crytosporidium parvum), Typhus fever 

1 5 {Richettsia powazekii), and viral encephalitis (alphaviruses, e.g., Venezuelan equine 

encephalitis; eastern equine encephalitis; western equine encephalitis); all of which are 
currently classified as Category B agents; and Nipan virus and hantaviruses, which are 
currently classified as Category C agents. In addition, other organisms, which are so 
classified or differently classified, may be identified and/or used for such a purpose in the 

20 future. It will be readily understood that the viral vectors and other constructs described 
herein are useful to deliver antigens from these organisms, viruses, their toxins or other 
by-products, which will prevent and/or treat infection or other adverse reactions with 
these biological agents. 

Administration of the vectors of the invention to deliver immunogens 

25 against the variable region of the T cells elicit an immune response including CTLs to 

eliminate those T cells. In rheumatoid arthritis (RA), several specific variable regions of 
TCRs which are involved in the disease have been characterized. These TCRs include 
V-3, V-14, V-17 and V-17, Thus, delivery of a nucleic acid sequence that encodes at 
least one of these polypeptides will elicit an immune response that will target T cells 

30 involved in RA. In multiple sclerosis (MS), several specific variable regions of TCRs 

which are involved in the disease have been characterized. These TCRs include V-7 and 
V-10. Thus, delivery of a nucleic acid sequence that encodes at least one of these 
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15 



20 



pCypeprtdes win eUci, an immune response ,h,. wi,, target T ce„s involved m MS. In 
eJLerma. seve., speciHc varia.ie regions of TCKs w.eH are invoived ,n *e dtsease 
,ave been characterized. These TCRs incinde V-6, V-S. V-,4 and V.,6, VOC V-7, 
V ,4 V 15 V-16 V-28andV.12. Thus, delivery of a nucleic acid molecule that 
encodes a. ieas, one of these polypeptides will elicit an immune response that w,ll target 
T cells involved in scleroderma. 

Thus a rAAV^erived recombinam viral vector of the inventton provides 
an efrtcient gene trarisfer vehicle which can deliver a selected transgene to a selected host 
oel, Vivo or e. v/vo even where the organism has neutralizing antibodies to one or more 
AAV sources. In one embodiment, the rAAV and the cells are mixed « v,Vo, the 
infected cells are cultured using conventional methodologies; and the transduced cells are 

re-infused into the patient. 

These compositions are particularly well suited to gene deh.ery for 
therapeutic purposes and for immunization, including inducing protective immumty. 
Further, the compositions of the invention may also be used for producuon of a des.red 
,ene product ,„ v„.o. For ,„ v,V™ production, a desired product (..g,. a prote.n) may be 
obtained from a desired cultur. following transfection of host cells with a rAAV 
containing the molecule encoding the desired product and culturing the cell culture under 
condrtions which permit expression. The expressed product may then be punfted and 
isolated, as desired. Suiuble techniques for transfection. cell culturing. pur.ftcauon. and 
isolation are known to those of skill in the art. 

The following examples illustrate several aspects and embodiments of the 

invention. 

EXAMPLE 1 - Computational analysis of primate AAV sequences 

A CoUeclion of primale (issues 

Sources of nonhuman primate tissues were described previously [N. 
Muzyczka, K. 1. Berns, in FieMs V.olo^ D. M. Knipe. P. M. Howley. Eds. (Lippincott 
Williams & Wilkins, Philadelphia, 2001), vol. 2. pp. 2327-23591. Human tissues were 
collected from either surgical procedures or postmortem examination or organ dorrors 
through two major national human tissue providers. Cooperative Human Tissue Netwo* 
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(CHTN) and National Disease Research Interchange (NDRT), Human tissues used for this 
study were comprised of 1 8 different tissue types that included colon, liver, lung, spleen, 
kidney, brain, small bowel, bone marrow, heart, lymph nodes, skeletal muscle, ovary, 
pancreas, stomach, esophagus, cervix, testis and prostate. The tissue samples came from a 
5 diverse group of individuals of different gender, races (Caucasian, African-American, 
Asian and Hispanic) and ages (23 - 83 years). Among 259 samples from 250 individuals 
analyzed, approximately 28% of tissues were associated with pathology. 
B. Detection and isolation of AAV sequences 

Total cellular DNAs were extracted from human and nonhuman primate 
10 tissues as described previously [R. W. Atchison, et al., Science 194, 754-756 (1965)]. 
Molecular prevalence and tissue distribution of AAVs in humans were determined by 
either signature or full-length cap PGR using the primers and conditions that were similar 
to those used for the nonhuman primate analysis. The same PGR cloning strategy used for 
the isolation and characterization of an expanded family of AAVs in nonhuman primates 

15 was deployed in the isolation of AAVs from selected human tissues. Briefly, a 3.1 kb 
fragment containing a part of rep and full length cap sequence was amplified from tissue 
DNAs by PGR and Topo-cloned (Invitrogen). The human AAV clones were initially 
analyzed by restriction mapping to help identify diversity of AAV sequences, which were 
subsequently subjected to full sequence analysis by Seq Wright (Seq Wright, Houston, TX) 

20 with an accuracy of 99.9%. A total of 67 capsid clones isolated from human tissues were 
characterized (hu.l - hu.67). From nonhuman primate tissues, 86 cap clones were 
sequenced, among which 70 clones were from rhesus macaques, 6 clones from 
cynomologus macaques, 3 clones from pigtailed macaques, 2 clones from a baboon and 5 
clones from a chimpanzee. 

25 C. Analysis of AAV sequences 

From all contiguous sequences, AAV capsid viral protein (vpl) open 
reading frames (ORFs) were analyzed. The AAV capsid VPl protein sequences were 
aligned with the ClustalXl .81™ program [H. D. Mayor, J. L. Melnick, Nature 210, 331- 
332 (1966)] and an in-frame DNA alignment was produced with the BioEdit'^'^ [U. 

30 Bantel-Schaal, H. Zur Hausen, Virology 134, 52-63 (1984)] software package. 

Phylogenies were inferred with the MEGA™ v2.1 and the TreePuzzle™ package, 
Neighbor-Joining, Maximum Parsimony , and Maximum Likelihood [M. Nei, S. Kumar, 
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Molecular Evolution and Phylogenetics (Oxford University Press, New York, 2000); H. 
A. Schmidt, K. Strimmer, M. Vingron, A. von Haeseler, Bioinformatics 18, 502-4 (Mar, 
2002); N. Saitou, M. Nei, Mol Biol Evol 4, 406-25 (Jul, 1 987)] algorithms were used to 
confirm similar clustering of sequences in monophylic groups. 
5 Clades were then defined from a Neighbor-Joining phylogenetic tree of all 

protein sequences. The amino-acid distances were estimated by making use of Poisson- 
correction. Bootstrap analysis was performed with a 1000 replicates. Sequences were 
considered monophylic when they had a connecting node within a 0.05 genetic distance. 
A group of sequences originating from 3 or more sources was considered a clade. The 
10 phylogeny of AAV was further evaluated for evidence of recombination through a 
sequential analysis. Homoplasy was screened for by implementation of the Split 
Decomposition algorithm [H. J. Bandelt, A. W. Dress, Mol Phylogenet Evol 1, 242-52 
(Sep. 1992)]. Splits that were picked up in this manner were then further analyzed for 
recombination making use of the Bootscan algorithm in the Simplot software [M. Nei and 
1 5 S. Kumar, Molecular Evolution and Phylogenetics (Oxford University Press, New York, 
2000)]. A sliding window of 400nt (lOnt/step) was used to obtain 100 bootstrap replicate 
neighbor-joining trees. Subsequently, Split Decomposition and Neighbor-Joining 
phylogenies were inferred from the putative recombination fragments. Significant 
improvement of bootstrap values, reduction of splits and regrouping of the hybrid 
20 sequences with their parental sources were considered the criterion for recombination. 

A number of different cap sequences amplified from 8 different human 
subjects showed phylogenetic relationships to AAV2 (5') and AAV3 (3') around a 
common breakpoint at position 1400 of the Cap DNA sequence, consistent with 
recombination and the formation of a hybrid virus. This is the general region of the cap 
25 gene where recombination was detected from isolates from a mesenteric lymph node of a 
rhesus macaque [Gao etaL, Proc Natl Acad Sci USA 100, 6081-6086 (May 13, 2002)]. 
An overall codon based Z- test for selection was performed implementing the Neib- 
Gojobori method [R. M. Kotin, Hum Gene TherS, 793-801 (Jul, 1994)]. 

The phylogenetic analyses were repeated excluding the clones that were 
30 positively identified as hybrids. In this analysis, goose and avian AAVs were included as 
outgroups[(l.Bossis, J. A. Chiorini,y 77, 6799-810 (Jun. 2003)]. Figure 1 is a 
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10 



neighbor-joining tree; similar relationships were obtained using maximum parsimony and 
maximum likelihood analyses. 

This analysis demonstrated 1 1 phylogenetic groups, which are 
summarized in Table I . The species origin of the 6 AAV clades and 5 individual AAV 
clones (or sets of clones) is represented by the number or sources from which the 
sequences were retrieved in the sampling. The total number of sequences gathered per 
species and per grouping is shown in between brackets. References for previously 
described sequences per clade are in the right column. Rhesus - rhesus macaques; cyno - 
cynomologus macaques; chimp - chimpanzees; pigtail - pigtail macaques. 

Table 1 

Classification of the number of sources (sequences) per species and per clade or clone 

Human Rhesus Cyno Baboon Chimp Pigtail 



C lade/representative 

A/AAV1(AAV6) 3(4) 

B/AAV2 12(22) 

C/AAV2-AAV3 8(17) 
hybrid 

D/AAV7 5(10) 5(5) 

E/AAV8 7(9) 7(16) 1(2) 1(3) 

F / AAV9 3(3) 

Clones 



AAV3 

AAV4 1(3) 
AAV5 

Ch.5 

Rh.8 2(2) 



1(1) 



Since, as noted above, recombination is not implemented in the standard 
1 5 phylogenetic algorithms used, in order to build a proper phylogenetic tree, those 

sequences were excluded from the analysis, of which their recombinative ancestry was 
established. A neighbor-joining analysis of all non-recombined sequences is represented 
side by side with the clades that did evolve making use of recombination. A similar 
output was generated with the different algorithm used and with the nucleotide sequence 
20 as input. 
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Additional experiments were performed to evaluate the relationship of 
phylogenetic relatedness to function as measured by serologic activity and tropism, as 
described in the following examples. 

5 EXAMPLE 2 - Serological analysis of novel human AAVs 

The last clade obtained as described in the preceding example was derived 
from isolates of 3 humans and did not contain a previously described serotype. 
Polyclonal antisera were generated against a representative member of this clade and a 
comprehensive study of serologic cross reactivity between the previously described 

10 serotypes was performed. This showed that the new human clade is serologically distinct 
from the other known serotypes and therefore is called Clade F (represented by AA V9). 

Rabbit polyclonal antibodies against AAV serotypes 1-9 were generated 
by intramuscularly inoculating the animals with 1 x lO'^ genome copies each of AAV 
vectors together with an equal volume of incomplete Freud's adjuvant. The injections 

1 5 were repeated at day 34 to boost antibody titers. Serological cross reactivity between 
AAV 1-9 was determined by assessing the inhibitory effect of rabbit antisera on 
transduction of 293 cells by vectors carrying a reporter gene (AAVCMVEGFP, which 
carries enhanced green fluorescent protein) pseudotyped with capsids derived from 
different AAV sources. Transduction of 84-31 cells by AAVCMVEGFP vectors was 

20 assessed under a UV microscope. In assessing serologic relationships between two 

AAVs, the ability of both heterologous and homologous sera to neutralize vectors from 
each AAV were tested. If neutralization by the serum was at least 16-fold lower against 
heterologous vectors than homologous vectors in a reciprocal manner, the two AAVs are 
considered distinct serotypes. Neutralization titers were defined as described previously 

25 [(G. P. Gao etai, Proc Natl Acad Sci USA 99, 1 1854-9 (Sep 3, 2002)]. 
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Table 2 

Serologic evaluation of novel AAV vectors 



Vector pseudotypes used in the neutralization assay 



from rabbit 
immunized 
with; 



AAV2/1 



AAV2/2 



AAV2/3 
AAV2/4 



AAV2/5 



AAV2/1 



1/ 

163,840 



1/80 



1/1,280 



1/20 



AAV2/6 



AAV2/7 



AAV2/8 



1/ 

20.480 



AAV2/2 



No NAB 



1/81,920 



1/2,560 



No NAB 



1/ 

81,920 



1/1.280 



1/20 



No NAB 



No NAB 



1/640 



1/1,280 



No NAB 



AAV2/3 



No NAB 



1/5,120 



1/40,960 



No NAB 



1/80 



1/640 



1/1.280 



1/1,280 



No NAB 



AAV2/4 



No NAB 



1/20 



1/20 



l/ly280 



No NAB 



1/40 



1/20 



No NAB 



No NAB 



AAV2/5 



1/40,960 



No NAB 



1/40 
1/40 



1/ 

163,840 



1/40 



No NAB 



1/20 



NoNAB 



AAV2/6 



1/40.960 
1/80 



1/2,560 



No NAB 



1/5,120 



1/327,680, 



1/1,280 



NoNAB 



NoNAB 



AAV2/7 



1/40 
1/40 



1/1.280 



No NAB 



1/40 



1/40 



1/640 



1/20 



AAV2/8 



NoNAB 



1/40 



1/1.280 



NoNAB 



AAV2/9 



NoNAB 



NoNAB 



NoNAB 



1/40 



NoNAB 



No NAB 



1/5,120 1/80 



li.§27,68 



NoNAB 



1/40 



1/640 



1/2,560 



15 



20 



These data confirm the phylogenetic groupings of the different clones and 
clades except for unanticipated serological reactivity of the structurally distinct AAV5 
and AAVl serotypes (i.e., ratio of heterologous/homologous titer were 1/4 and 1/8 in 

reciprocal titrations). 

The result further indicated thatAAVhu.14 had a distinct serological 
property and did not have significant cross reactivity with antisera generated from any 
known AAV serotypes. The serological distinctiveness of AAVhu.14 was further 
supported by its uniqueness in the capsid structure which shared less than 85% amino 
acid sequence identity with all other AAV serotypes compared in this study. Those 
findings provided the basis for us to name AAVhu.14 as a new serotype. AAV9. 

EXAMPLE 3 - Evaluation of primate AAVs as gene transfer vectors 

The biological tropisms of AAVs were studied by generating vector pseudotyped 
in which recombinant AAV2 genomes expressing either GFP or the secreted reporter 
gene a-1 antitrypsin (A1 AT) were packaged with capsids derived from various clones and 
one representative member from each primate AAV clade for comparison. For instance, 
the data obtained from AAVl was used to represent Clade A, followed by AAV2 for 
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Clade B, Rh.34 for AAV4, AAV7 for Clade D, AAV8 for Clade E, and AAVHu.I4 for 
Clade F. AAV5, AAVCh.5 and AAVRh.8 stand as single AAV genotypes for the 
comparison. 

The vectors were evaluated for transduction efficiency in vitro, based on GFP 
5 transduction, and transduction efficiency in vivo in liver, muscle or lung (Fig. 4). 

A. In Vitro 

Vectors expressing enhanced green fluorescent protein (EGFP) v^ere used 
to examine their in vitro transduction efficiency in 84-31 cells and to study their 
serological properties. For functional analysis, in vitro transduction of different 

10 AAVCMVEGFP vectors was measured in 84-31 cells that were seeded in a 96 well plate 
and infected with pseudotyped AAVCMVEGFP vectors at an MOI of 1 x 10^ GC per 
cell. AAV vectors were pseudotyped with capsids of AAVs 1, 2, 5, 7, 8 and 6 other 
novel AAVs (Ch.5, Rh.34, Cy5, rh.20, Rh.8 and AAV9) using the technique described in 
G. Gao etal., Proc Natl Acad Sci USA 99, 1 1854-9 (Sep 3, 2002). Relative EGFP 

1 5 transduction efficiency was scored as 0, 1,2 and 3 corresponding to 0-10%, 10-30%, 30- 
70% and 70-100% of green cells estimated using a UV microscope at 48 hours post 
infection. 

B. In Vivo 

For in vivo studies, human a-antitrypsin (Al AT) was selected as a 
20 sensitive and quantitative reporter gene in the vectors and expressed under the control of 
CMV-enhanced chicken P-actin promoter. Employment of the CB promoter enables high 
levels of tissue non-specific and constitutive Al AT gene transfer to be achieved and also 
permits use of the same vector preparation for gene transfer studies in any tissue of 
interest. Four to six week old NCR nude mice were treated with novel AAV vectors 
25 (AAVCBhAlAT)atadoseof IxlO'^ genome copies per animal through intraportal, 
intratracheal and intramuscular injections for liver, lung and muscle directed gene 
transfer, respectively. Serum samples were collected at different time points post gene 
transfer and Al AT concentrations were determined by an ELISA-based assay and scored 
as 0, 1,2 and 3 relative to different serum Al AT levels at day 28 post gene transfer, 
30 depending on the route of vector administration (Liver: 0 = AlAT <400 ng/ml, 1 = 

Al AT 400-1000 ng/ml, 2 = AlAT 1000-10,000 ng/ml, 3 - AlAT > 10,000 ng/ml; Lung: 
0 = AlAT < 200 ng/ml, 1 = AlAT 200-1000 ng/ml, 2 = AlAT 1000-10,000 ng/ml, 3 = 
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15 



MAT > 10,000 ng/ml; Muscle: 0 = MAT < 100 ng/ml, 1 = AlAT 100-1000 ng/ml, 2 = 
AlAT 1000-10,000 ng/ml, 3 = Al AT > 10,000 ng/ml). 

A human AAV, clone 28.4/hu.l4 (now named AAV9), has the ability to transduce 
liver at a efficiency similar to AAV8. lung 2 logs better than AAV5 and muscle superior 
to AAVl whereas the performance of two other human clones, 24.5 and 16.12 (hu.l2 
and hu.l3) was marginal in all 3 target tissues. Clone N721.8 (AAVrh.43) is also a high 

performer in all three tissues. 

To further analyze gene transfer efficiency of AAV9 and rh 43 in comparison 
with that of bench markers for liver (AAV8), lung (AAV5) and muscle (AAV 1 ), a dose 
response experiment was carried out. Both new vectors demonstrated at least 10 fold 
more gene transfer than AAVl in muscle, similar performance to AAV8 in liver and 2 
logs more efficient than AAV5 in lung. 

A group of AAVs demonstrated efficient gene transfer in all 3 tissues that was 
similar or superior to the performance of their bench marker in each tissue has emerged. 
To date, 3 novel AAVs have fallen into this category, two from rhesus (rhlO and 43) and 
one from human (hu. 1 4 or AA V9). A direct comparison of relative gene transfer 
efficiency of those 3 AAVs to their bench markers in the murine liver, lung and muscle 
suggests that some primate AAVs with the best fitness might have evolved from rigorous 
biological selection and evolufion as "super" viruses. These are particularly well suited 
20 for gene transfer applications. 

C. Profiles of Biological Activity 

Unique profiles of biological activity, in terms of efficiency of gene 
transfer, were demonstrated for the different AAVs with substantial concordance within 
members of a set of clones or clade. However, in vitro transduction did not predict the 
25 efficiency of gene transfer in v,vo. An algorithm for comparing the biological activity 

between two different AAV pseudotypes was developed based on relative scoring of the 
level of transgene expression and a cumulative analysis of differences. 

Cumulative differences of the gene transfer scores in vitro and in vivo 
between pairs of AAVs were calculated and presented in the table (ND = not determined) 
30 according to the following formula. Cumulative functional difference in terms of scores 
between vectors A and B = in vitro (A - B) + lung (A - B) + liver (A - B) muscle (A - 
B) The smaller the number, the more similar in function the AAVs. In the grey shaded 
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area, the percentage difference in sequence is represented in bold italic. The percentage 
difference in cap structure was determined by dividing the number of amino-acid 
differences after a pairwise deletion of gaps by 750, the length of the VPl protein 
sequence alignment. 
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These studies point out a number of issues relevant to the study of 
parvoviruses in humans. The prevalence of endogenous AAV sequences in a wide array 
of human tissues suggests that natural infections with this group of viruses are quite 
common. The wide tissue distribution of viral sequences and the frequent detection in 

10 liver, spleen and gut indicate that transmission occurs via the gastrointestinal track and 
that viremia may be a feature of the infection. 

The tremendous diversity of sequence present in both human and 
nonhuman primates has functional correlates in terms of tropism and serology, suggesting 
it is driven by real biological pressures such as immune escape. Clearly, recombination 

15 contributes to this diversity as evidenced by the second most common human clade, 
which is a hybrid of two previously described AAVs. 

Inspection of the topology of the phylogenetic analysis reveals insight into 
the relationship between the evolution of the virus and its host restriction. The entire 
genus of dependoviruses appears to be derived from avian AAV consistent with 

20 Lukashov and Goudsmit [(V. V. Lukashov, J. Goudsmit, J Virol 75, 2729-40 (Mar, 

2001)]. The AAV4 and AAV5 isolates diverged early from the subsequent development 
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of the other AAVs. The next important node divides the species into two major 
monophilic groups. The first group contains clones isolated solely from humans and 
includes Clade B, AAV3 clone, Clade C and Clade A; the only exception to the species 
restriction of this group is the single clone from chimpanzees, called ch.5. The other 
5 monophilic group, representing the remaining members of the genus, is derived from both 
human and nonhuman primates. This group includes Clade D and the rh.8 clone, which 
were isolated exclusively from macaques, and the Clade F, which is human specific. The 
remaining clade within this group (/.e., Clade E) has members from both humans and a 
number of nonhuman primate species suggesting transmission of this clade across species 

10 barriers. It is interesting that the capsid structures of Clade E members isolated from 

some humans are essentially identical to some from nonhuman primates, indicating that 
very little host adaptation has occurred. Analysis of the biology of AAVS derived vectors 
demonstrated a broad range of tissue tropism with high levels of gene transfer, which is 
consistent with a more promiscuous range of infectivity, and may explain its apparent 

1 5 zoonosis. An even greater range and efficiency of gene transfer was noted for the Clade 
F, highlighting the potential for cross species transmission, which to date has not been 
detected. 

The presence of latent AAVs widely disseminated throughout human and 
nonhuman primates and their apparent predisposition to recombine and to cross species 

20 barriers raises important issues. This combination of events has the potential to lead to 
the emergence of new infectious agents with modified virulence. Assessing this potential 
is confounded by the fact that the clinical sequalae of AAV infections in primates has yet 
to be defined. In addition, the high prevalence of AAV sequences in liver may contribute 
to dissemination of the virus in the human population in the setting of allogeneic and 

25 xenogenic liver transplantation. Finally, the finding of endogenous AAVs in humans has 
implications in the use of AAV for human gene therapy. The fact that wild type AAV is 
so prevalent in primates without ever being associated with a malignancy suggests it is 
not particularly oncogenic. In fact, expression of AAV rep genes has been shown to 
suppress transformation P. L. Hermonat, Virology 111, 253-61 (Sep, 1989)]. 
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EXAMPLE 4 - AAV 2/9 Vector for the Treatment of Cystic Fibrosis Airway 
Disease 

To date, CFTR gene transfer to the lung for the treatment of CF airway disease 
has been limited by poor vector performance combined with the significant barriers that 
5 the airway epithelium poses to effective gene transfer. The AAV2 genome packaged in 
the AA V9 capsid (AA V2/9) was compared to AAV2/5 in various airway model systems. 

A 50 ^l single dose of 1 x lO'' genome copies (gc) of AAV2/9 expressing either 
the nuclear targeted p-galactosidase (nLacZ) gene or the green fluorescence protein 
(GPP) gene under the transcriptional control of the chicken p-actin promoter was instilled 

1 0 intranasal ly into nude and also C57B1/6 mice. Twenty-one days later, the lung and nose 
were processed for gene expression. In control animals transduced with AAV2/9-GFP, 
no LacZ positive cells were seen, AAV2/9-nLacZ successfully transduced mainly 
airways, whereas AAV2/5-nLacZ transduced mainly alveoli and few airways. Across the 
nasal airway epithelium, both AAV2/5 and AAV2/9 transduced ciliated and non-ciliated 

1 5 epithelial cells. 

Epithelial cell specific promoters are currently being evaluated to improve 
targeting to the airway cells in vivo. Based on the in vivo findings, the gene transfer 
efficiency of AAV2/9 to human airway epithelial cells was tested next. Airway epithelial 
cells were isolated from human trachea and bronchi and grown at air-liquid-interface 

20 (ALl) on collagen coated membrane supports. Once the cells polarized and differentiated, 
they were transduced with AAV2/9 or AAV2/5 expressing GFP from the apical as well as 
the basolateral side. Both AAV2/5 and AAV2/9 were successful at transducing epithelial 
cells from the basolateral surface. However, when applied onto the apical surface 
AAV2/9 resulted in a 1 0-fold increase in the number of transduced cells compared to 

25 AA V2/5. Currently, the gene transfer performance of AAV2/9 in the lungs and nasal 
airways of nonhuman primates is being evaluated. 

This experiment demonstrates that AAV2/9 can efficiently transduce the airways 
of murine lung and well-differentiated human airway epithelial cells grown at ALL 
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EXAMPLE 5 - Comparison of direct injection of AAV1(2/1) and AAV9(2/9) in 
adult rat hearts 

Two adult (3 month old) rats received a single injection of 5x1 0 particles of 
AAV2/1 or AAV2/9 in the left ventricle 

The results were spectacular, with significantly more expression observed m the 
adult rat heart with A AV2/9 vectors as compared to AAV2/1 , as assessed by lacZ 
histochemistry. AAV2/9 also shows superior gene transfer in neonatal mouse heart. 

EXAMPLE 6 - AAV2/9 Vector for Hemophilia B Gene Therapy 

In this study, AAV 2/9 vectors are shown to be more efficient and less 
immunogenic vectors for both liver and muscle-directed gene therapy for hemophilia B 

than the traditional AAV sources. 

For a liver-directed approach, evaluation of the AAV2/9 pseudotyped vector was 
performed in mouse and dog hemophilic models. In immunocompetent hemophilia B 
mice (in C57BL/6 background), long-term superphysiological levels of canine Factor TX 
(cFIX 41-70 ^g/ml) and shortened activated partial thromboplastin time (aPTT) have 
been achieved following intraportal injection of IxlO" genome copies (GC)/mouse of 
AAV2/7 2/8, and 2/9 vectors in which the cFlX is expressed under a liver specific 
promoter (LSP) and woodchuck hepatitis B post-transcriptional responsive element 
(WPRE) A lO-fold lower dose(lxlO'°GC/mouse)ofAAV2/8 vector generated normal 
level of CFIX and aPTT time. In University of North Caroline (UNC) hemophilia B dogs, 
it was previously demonstrated that administration of an AAV2/8 vector into a dog 
previously treated with an AAV2 vector was successful; cFlX expression peaked at 10 
ng/ml day 6 afterthe 2"" intraportal injection (dose=5xlO- GC/kg), then gradually 
5 decreased and stabilized around 700ng/ml (16% of the normal level) throughout the study 
(1 1/2 years). This level was about 3-fold higher than that from a hemophiha B dog that 
received a single injection of AAV2-cFlX at the similar dose. Recently, two naYve 
hemophilia B dogs were injected with AAV2/8 vectors intraportally at the dose of 
5 25x10'^ GC/kg cFlX levels in one dog (male) reached 30% of normal level (1 .5 
50 ug/ml) ten weeks after injection and has sustained at 1 .3-1 .5 ^g/ml, while the second dog 
(female) maintained cFlX expression at about 1 0% of normal level. Whole blood clottmg 
time (WBCT) and aPTT were both shortened after the injection, suggesting the ant.gen 
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was biologically active. Liver enzymes (aspartate amino transferase (SCOT), alanine 
amino transferase (SGPT) in both dogs remained in the normal range after surgery. 
These AAV were also evaluated for muscle-targeted gene therapy of hemophilia B. 
AAV-CMV-cFIX-WPRE [an AAV carrying cFIX under the control of a CMV promoter 
5 and containing the WPRE] packaged with six different AAV sources were compared in 
immunocompetent hemophilia B mice (in C57BL/6 background) after intramuscular 
injection at the dose of 1x1 0^' GC/mouse. cFJX gene expression and antibody formation 
were monitored. Highest expression was detected in the plasma of the mice injected with 
AAV2/8 vectors (1460+392 ng/ml at day 42), followed by AAV2/9 (773+1 71 ng/ml at 
0 day 42) and AA V2/7 (500±3 I 1 ng/ml at day 42). Levels were maintained for 5 months. 
Surprisingly, cFIX expression by AAV2/1 ranged from 0-253 ng/ml (average: 66±82 
ng/ml). Anti-cFIX inhibitor (IgG) was detected in some of the AAV2/] -injected mice. 
cFIX expression levels in these mice correlated well with inhibitor levels. Further 
screening of inhibitor formation was performed on day 28 samples for all AAV. 
5 Hemophilia B mice showed highest inhibitor formation against AAV2/2, followed by 
AAV2/5, and AAV2/1. Only sporadic and low level inhibitors were detected in animals 
injected with AAV2/7, AAV2/8 and AAV2/9. Thus, the advantages of the new AAV 
serotype 2/9 vectors for muscle-directed gene therapy for hemophilia B as more efficient 
and safe vectors without eliciting any significant anti-FIX antibody formation are shown. 

EXAMPLE 7 - Novel Rh.43 Vectors of Invention 

A . Comparison ofAA Vrh. 43 based A J AT expression vector with A A V8 
and AA V9 in mouse liver directed gene transfer 

Novel AAVrh.43, which belongs to Clade E by phylogenetic analysis 
vector was compared to AAV8 and novel AAV9 for hAl AT levels after intraportal 
infusion to the mouse liver. More particularly, pseudotyped AAVrh.43, AAV2/8 and 
AAV2/9 vectors were compared in mouse liver-directed gene transfer. Pseudotyped 
vectors at doses of 1 xlO" GC, 3x10^^ GC and 1x1 0'^ GC per animal were administrated 
to 4 - 6 week old C57BL/6 mouse intramuscularly. Serum samples were collected from 
animals at day 28 post vector infusion for the human alpha 1 anti-trypsin (hAl AT) assay. 

The data indicated that the novel AAVrh.43 vector had indeed a 
performance similar to that of AAV9 in the mouse model. 
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B. Nuclear target LacZ gene transfer to mouse liver and muscle mediated 
by pseudotyped A A V vectors. 

Novel AAV9 and AAVrh.43 based vectors of the invention were 
compared to AAVl and AAV2-based vector. The vectors were injected at a dose of 
5 IxlO" GC per mouse either intraportally to target liver or intramuscularly to the right 
anterior tibialis muscle of C57BL/6 mice intramuscularly. The animals were sacrificed at 
day 28 post gene transfer and tissues of interest harvested for X-gal histochemical 
staining. 

The AAVrh.43 vector demonstrated gene transfer efficiency that was 
10 close to AAV9 but at least 5 fold higher than AAVl. The property of AAVrh.43 was 

further analyzed in both liver and muscle using nuclear targeted LacZ gene as a reporter 

to visualize extend of gene transfer histochemically. 

C Comparison ofAA Vrh 43 based A J AT expression vector with AA V5 in 

mouse lung directed gene transfer 
1 5 A novel rh.43-based vector of the invention also demonstrated superb 

gene transfer potency in lung tissue. Different doses (lxlO'°, 3x10^^ and 1x1 0'^ GC per 

animal) of pseudotyped vectors were administrated to 4 - 6 week old C57BL/6 mouse 

lungs intratracheally. Serum samples were collected from animals at different time points 

for hAl AT assay. 

20 This vector was compared to AAV5 at different doses for levels of 

hAl AT detected systematically after intratracheal instillation to the mouse lung. The data 
indicated that this novel vector was at lease 100 fold more efficient than AAV5 in the 
mouse model. 

EXAMPLE 8 - Novel human AAV based vectors in mouse models for liver and 
lung-directed gene transfer 

The human clones, AAVhu.37, AAVhu.41 and AAVhu.47 were pseudotyped and 
examined for gene transfer potency in mouse tissues. AAVCBAl AT vectors 
pseudotyped with capsids of hu.37, hu.41 and hu.47 were prepared using the methods 
described herein and administrated to 4 - 6 week old C57BL/6 mouse through intraportal 
and intratracheal injections. Serum samples were collected from animals at day 14 post 
vector injection for hAl AT assay, which was performed in accordance with published 
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techniques. AAVhu.47 belongs to AAV2 family (clade B) AAV2 and was isolated from 
a human bone marrow sample. AAVhu.37 and AAVhu.41 came from a human testis 
tissue and a human bone marrow sample respectively. Phylogenetically, they fall into the 
AAV 8 clade (clade E). 
5 Serum Al AT analysis of injected animals indicated that AAV hu.4I and AAV 

hu.47 performed poorly in the three tissues tested. However, gene transfer potency of 
AAVhu.37 derived vector was similar to that of AAV8 in liver and AAV9 in lung 

10 All publications cited in this specification are incorporated herein by 

reference. While the invention has been described with reference to particularly preferred 
embodiments, it will be appreciated that modifications can be made without departing 
from the spirit of the invention. 
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